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Human Chitinase Alpha and Chitinase Alpha-2 
Background of the Invention 
Field of the Invention 

The present invention relates to novel human chitinase genes - chitinase 
alpha and chitinase alpha-2. More particularly, the invention relates to nucleic 
acid sequences encoding mature chitinase alpha or mature chitinase alpha-2 or 
fragments thereof, the use of such polypeptides for treating disease, as well as 
processes for producing these polypeptides recombinantly. 

Related Art 

The generation or destruction of tissue requires constant reorganization 
and restructuring of the extracellular matrix (ECM) components, including 
interstitial collagens, basement membrane collagen, fibronectin, laminin, aggrecan, 
and various proteoglycans. Heinegard and Oldberg, FASEB J. 5:2042-2051 
(1989). Normal types of remodeling processes include embryonic development, 
post-partum involution of the uterus, ovulation, wound healing, and bone and 
growth plate remodeling. Woessner et aL, Steroids 54: 491-499 (1989); Weeks 
et a/., Biochim. Biophys. Acta 445:205-214 (1976); Lepage and Gache, EMBO 
J. 9:3003-3012 (1990); Wride and Sanders, Dev-Dyn. 7PS(3):225-239 (1993). 
Similar processes also occur in disease states such as joint destruction in 
rheumatoid and osteoarthritis, periodontia, and tumor cell metastasis. Thompson 
and Oegema, J. Bone Joint Surg. 67:407-416 (1979); Reynolds et aL % Adv-Dent- 
Res. 5(2):312-319 (1994). One example of these processes is the migration of 
macrophages to the ate of inflammation as in the case of synovial tissue in 
rheumatoid arthritis. Cutolo etaL, Clin. andExper. Rheum. 77:331-339 (1993). 
The ECM components are regulated, in both normal and disease states, by various 
exogenous and endogenous factors. For example, in tumor formation, the 



differentiation state of the cell can increase the rate of degradation of the ECM 
Benya, Pathol ImmunopathoL Res. 7:51-54 (1988). Likewise, the presence of 
metalloproteinases or their inhibitors can alter the composition of the ECM. An 
imbalance of metalloproteinases and tissue inhibitors of matrix metalloproteinases 
(TTMP) has been shown to contribute to the pathogenesis of osteoarthritis. Dean 
et aL 9 J. Clin Invest 54:678-685 (1989). Cytokines, growth factors, and the 
extracellular environment can all contribute to the alteration of the ECM. Tyler, 
Biochem J. 227:869-878 (1985); Dinarello, Sem. Immunol 4:133-145 (1992); 
McConnell etaLJ. Cell Biol 705:1087-1098.(1987). 

The growth of cartilage and bone is actualized by cells such as articular 
chondrocytes and osteoblasts. The main function of these cells in immature tissue 
is the deposition and remodeling of the cartilage or bone matrix. In adult tissue, 
these cells maintain this matrix in order to ensure its proper function. In both 
case, this encompasses secretion of proteins involved in the turnover of the ECM. 

A major species of protein secreted by these cells and involved in the 
turnover of the ECM are the metalloproteinases. Woessner, FASEB J. 5:214- 
2154 (1991). As will be discussed below, new types of secretory glycoproteins 
have also been identified in human cartilage, osteoblasts, synovial cells, sheep and 
bovine oviduct and mammary cells, and macrophages. Nyrikos and Golds, 
Biochem. J. 265:265-268 (1990); Hakala et at, J. Biol C/rem.2<W(34):25803- 
25810 (1993); Johansen et al 9 J. Bone and Min. Res. 7(5):501-511 (1992); 
Rejman and Hurley, Biochem. Biophys. Res. Commun. 750:329-334 (1988); 
DeSouza and Murray, Endocrinology 755(6):2485-2496 (1995); Hollak et al y 
J. Clin. Invest 93:1288-1292 (1995); Arias et of., Biol of Reproduction 57:685- 
694 (1994). These proteins all share regions of significant homology to bacterial 
and fungal chitinases, and therefore are referred to herein as "chitinase-like 
proteins." They bear a subtle similarity to lysozymes from mammals and function 
as endoglycosidases, with a specificity for N-acetyl-glucosamine linkages. 
However, these types of chitin-like structures (homopolymers of N-acetyl- 
glucosamine) were not normally encountered in mammalian tissue. 
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The human cartilage glycoprotein, HC gp-39, is a mammalian chitinase- 
like protein with an apparent molecular weight of approximately 39 kDa that is 
secreted by both articular chondrocytes and synovial fibroblasts. Nyrikos and 
Golds, supra; Hakala et al y supra. This protein has been described as a marker 
for joint injury, appearing in the blood and synovial fluid from patients diagnosed 
with rheumatoid arthritis. Johansen et aL y British J. of Rheumatology 32:949-955 
(1993). The gene encoding this protein has been cloned. Hakala et aL 9 supra. 
Neither the protein nor mRNA for HC gp-39 was detectable in normal newborn 
or adult human articular cartilage obtained at surgery, but was detectable both in 
synovial specimens and in cartilage obtained from patients with rheumatoid 
arthritis. Hakala etaL, supra. The observations of Hakala et al suggest that the 
expression of HC-gp-3 9 may be related to a response of these cells to an altered 
tissue environment. 

The protein YKL-40 has also been identified as one of the major secretory 
products of cultured human osteoblastic cells (osteo carcinoma cell line MG-63) 
expressed in response to l 9 25-dihydroxyvitaminD3 stimulation. Johansen era/*, 
J. Bone and Mirt Res. 7(5):501-511 (1992); Johansen et aL 9 British J. of 
Rheumatology 52:949-955 (1993). The N-terminal portion of YKL-40 was 
sequenced and found to be identical to HC gp-39. Upon further sequencing, 
YKL-40 and HC gp-39 were found to be identical. 

Chitotriosidase is an enzyme that has been identified as a member of the 
chitinase protein family. Renkema et aL y J. Biol Chem. 27(7:2198-2202 (1995); 
Hollak et al. 9 J. Clin, Invest 95:1288-1292 (1994). This protein also has an 
apparent molecular weight of 39 kDa and shares N-terminal homologies with HC 
gp-39, the bovine mammary protein, and several bacterial chitinases. Activity of 
this enzyme was originally detected from cells of patients afflicted with Gaucher 
Disease (GD). Gaucher Disease is an inherited deficiency in the activity of 
glucocerebrosidase, lysosomal hydrolase. The defect results in an accumulation 
of glucosylceramide (glucocerebroside) in the iysosomes of macrophages. 
Accumulation of lipid-laden macrophages results in hepatosplenomegaly, bone 
lesions, and neurological anomalies. After morphological differentiation f 
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monocytes in the macrophages in culture, the cells begin to produce secrete 
increasing amounts of chitotriosidase. This increase was, on average, 600 times 
greater in symptomatic GD patients than in healthy volunteers. Holiak et al, 
supra. The elevation in chitotriosidase activity can be effectively reduced, 
however, upon initiation of enzyme supplementation therapy. 

Unlike other members of the family of proteins homologous to chitinase, 
chitotriosidase does have chitinolytic activity. Renkema et al, supra. Like the 
bacterial enzyme; it has ability to degrade chitin azure, a polymer of betal-4-linked 
N-aeetylglucosamme moieties. Chitotriosidase, along with HC gp-39, is beiieved 
to be involved in tissue remodeling in the mammalian cell, and thus may serve as 
useful tools in the development of therapeutics and diagnostics for tissue 
remodeling disorders. For example, the enzyme can serve as a valuable diagnostic 
marker for Gaucher disease, and may be useful in monitoring the efficacy of 
therapy. 

Chitin is a major structural component of the cell walls of fungal 
pathogens. Chhinases are enzymes found in a variety of species, including plants, 
bacteria, fungi, insects, and fish (Flach etal, Expehentia 45:701-716 (1992)), that 
hydrolyze chitin or chitin-like substrates, such as 4-methylumbelliferyl 
chitotrioside. It is well documented that plant chhinases are potent inhibitors of 
fungal growth. Bro&zert etal 3 Physiol Mol Plant Pathol 35:319-331 (1988); 
Roberts et aL 9 J. Gen. Microbiol 73^:169-174(1988). Thus, plant chitinases may 
protect plants against fungi and other pathogens. It has often been suggested that 
humans and other mammals do not possess an analogous chitinase. Raghavan et 
a/., Infect. Immun. 62:1901-1908 (1994); Huber etcd. y Proc. Natl Acad ScL 
USA £«:2807-2810 (1991). This may not be the case. 

Hollak et al t discussed supra, has now observed that chitotriosidase is 
secreted in large quantities by activated human macrophages and has been 
reported to be markedly elevated in the plasma of patients suffering from Gaucher 
disease. Boot et a/., J. Biol Chem. 270:26252-26256 (1995), has recently 
reported on the cloning of chitotriosidase. This report also suggests that this 
enzyme may play a role in the degradation of chitin-containing pathogens. Escott 



and Adams, Infect Immun. 63:4770-4773 (1995), report on the detection of a 
chitinase in human serum and leukocytes that cleaves colloidal [ 3 H] chitin They 
report that the dutinase activity detected in human granulocytes may reflect that 
fact that the enzyme in humans may play a protective role against fungi and other 
chitin-containing microbial pathogens. This role is conceivable in view of the 
well-documented role of plant chitinases as potent inhibitors of fungal growth. 

Accordingly, there is a need in the art for the identification of novel human 
chitinase enzymes, which along with chitotriosidase and HC gp-39, can serve as 
useful tools in the development of therapeutics and diagnostics for tissue 
remodeling disorders or as anti-fungal agents. 

Summary of the Invention 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding a human chitinase alpha polypeptide having the amino 
acid sequence in Figure 1 [SEQ ID NO:2] or Figure 2 [SEQ ID NO:4] or the 
amino acid sequence encoded by the cDNA clone deposited in a bacterial host as 
ATCC Deposit Number 97529 on May 3, 1996. The present invention further 
provides isolated nucleic acid molecules comprising a polynucleotide encoding a 
human chitinase alpha-2 polypeptide having the amino acid sequence in Figure 5 
[SEQ ID NO: 12]. 

The nucleotide sequence determined by sequencing the deposited chitinase 
alpha cDNA clone, which is shown in Figure 1 [SEQ ID NO: 1], contains an open 
reading frame encoding a polypeptide of about 421 amino acid residues including 
an initiation codon at nucleotide positions 82-84, a leader sequence of about 26 
amino acid residues and a deduced molecular weight of about 44 kDa. The 395 
amino acids of the predicted mature chitinase alpha protein is shown in Figure 1 
(last 395 residues) and in SEQ ID NO:2 (from amino acid residue 27 to residue 
421). 

The nucleotide sequence shown in Figure 5 [SEQ ID NO: 1 1 ] contains an 
open reading frame encoding a polypeptide of about 385 amino acid residues 



including an initiation cod n at nucleotide positions 27-29 and a leader sequence 
of about 20 amino acid residues. The 365 amino acids of the predicted mature 
chitinase alpha-2 protein is shown in Figure 5 (last 365 residues) and in SEQ ID 
NO: 12 (from amino acid residue 21 to residue 385). As discussed in more detail 
below, the chitinase alpha-2 gene is a shorter variant of the chitinase alpha gene. 

Thus, one aspect of the invention provides isolated nucleic acid molecules 
comprising a polynucleotide having a nucleotide sequence selected from the group 
consisting of: (a) a nucleotide sequence encoding the chitinase alpha polypeptide 
having the complete amino acid sequence shown in Figure 1 [SEQ ID NO:2] or 
Figure 2 [SEQ ID NO:4]; (b) a nucleotide sequence encoding the mature 
chitinase alpha polypeptide having the amino acid sequence at positions 27-421 
in Figure 1 [SEQ ID NO:2] or Figure 2 [SEQ ID NO:4j; (c) a nucleotide 
sequence encoding the chitinase alpha polypeptide having the complete amino acid 
sequence encoded by the cDNA clone contained in ATCC Deposit No. 97529; "(d) 
a nucleotide sequence encoding the mature chitinase alpha polypeptide having the 
amino acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
97529; (e) a nucleotide sequence encoding the chitinase alpha-2 polypeptide 
having the complete amino acid sequence shown in Figure 5 [SEQ ID NO: 12]; 
(f) a nucleotide sequence encoding the mature chitinase alpha-2 polypeptide 
having the amino add sequence at positions 21-385 in Figure 5 [SEQ ID NO: 12]; 
and (g) a nucleotide sequence complementary to any of the nucleotide sequences 
in (a), (b), (c), (d), (e) or (0 above. Preferably, the nucleic acid molecule will 
encode the mature polypeptide as shown in Figure 1 [SEQ ID NO:2], Figure 2 
[SEQ ED NO:4], Figure 5 [SEQ ID NO: 12] or as encoded by the above-described 
deposited cDNA. 

Further embodiments of the invention include isolated nucleic acid 
molecules that comprise a polynucleotide having a nucleotide sequence at least 
90% identical, and more preferably at least 95%, 97%, 98% or 99% identical, to 
any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), or (g) above, or a 
polynucleotide which hybridizes under stringent hybridization conditions to a 
polynucleotide having a nucleotide sequence identical to a nucleotide sequence in 
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(a), (b), (c), (d), (e), (0 or (g) above. The polynucleotide which hybrizes does not 
hybridize under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence consisting of only A residues or of only T residues. An 
additonal nucleic acid embodiment of the invention relates to an isolated nucleic 
acid molecule comprising a polynucleotide which encodes the amino acid 
sequence of an epitope-bearing portion of an chitinase alpha polypeptide having 
an amino acid sequence in (a), (b), (c), (d), (e), or (0 above. 

The present invention also relates to recombinant vectors which includ 
the isolated nucleic acid molecules of the present invention and host cells 
containing the recombinant vectors, as well as methods of making such vectors 
and host cells and for using them for production of chitinase alpha- and chitinase 
alpha-2- polypeptides or peptides by recombinant techniques. 

The invention further provides an isolated chitinase alpha or chitinase 
alpha-2 polypeptide having amino acid sequence selected from the group 
consisting of: (a) the amino acid sequence of the chitinase alpha polypeptide 
having the complete 421 amino acid sequence including the leader sequence 
shown in Figure 1 [SEQ ID NO:2] or Figure 2 [SEQ ID NO:4]; (b) the amino 
acid sequence of the mature chitinase alpha polypeptide (without the leader) 
having the amino acid sequence at positions 27-421 in Figure 1 [SEQ ID NO:2] 
or Figure 2 [SEQ ID NO:4]; (c) the amino acid sequence of the chitinase alpha 
polypeptide having the complete amino acid sequence including the leader 
encoded by the cDNA clone contained in ATCC Deposit No. 97529; (d) the 
amino acid sequence of the mature chitinase alpha polypeptide having the amino 
acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 
97529; (e) the amino acid sequence of the chitinase alpha-2 polypeptide having 
the complete 385 amino acid sequence including the leader sequence shown in 
Figure 5 [SEQ ID NO: 12]; and (0 the amino acid sequence of the mature chitinase 
alpha-2 polypeptide (without the leader) having the amino acid sequence at 
positions 21-385 in Figure 5 [SEQ ID NO: 12]. 

The polypeptides of the present invention also include polypeptides having 
an amino acid sequence with at least 90% similarity, more preferably at least 95% 
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similarity to those described in (a), (b), (c), (d), (e), or (0 above, as well as 
polypeptides having an amino acid sequence at least 80% identical, more 
preferably at least 90% identical, and still more preferably 95%, 97%, 98% or 
99% identical to those above. 

An additional embodiment of this aspect of the invention relates to a 
peptide or polypeptide which has the amino acid sequence of an epitope-bearing 
portion of a chhinase alpha polypeptide or a chitinase alpha-2 polypeptide having 
an amino acid sequence described in (a), (b), (c), (d), (e), or (f) above. Peptides 
or polypeptides having the amino acid sequence of an epitope-bearing portion of 
a chitinase alpha polypeptide or a chitinase alpha-2 polypeptide of the invention 
include portions of such polypeptides with at least six or seven, preferably at least 
nine, and more preferably at least about 30 amino acids to about SO amino acids, 
although epitope-bearing polypeptides of any length up to and including the entire 
amino acid sequence of a polypeptide of the invention described above also are 
included in the invention. In another embodiment the invention provides an 
isolated antibody that binds specifically to a chitinase alpha polypeptide or 
chitinase alpha-2 polypeptide having an amino acid sequence described in (a), (b), 
(c), (d), (e), or (f) above. 

Chitinase alpha and chitinase alpha-2, which are secreted, and which have 
significant homology to human cartilage glycoprotein-39 (HC gp-39) and to 
chitotriosidase, are believed by the present inventors to be involved In tissue 
remodeling. Therefore, chitinase alpha and chitinase alpha-2 are useful targets for 
the inhibition of tissue remodeling disease processes, such as rheumatoid arthritis. 
Accordingly, the invention is related to a method of treating an individual in need 
of a decreased level of chitinase alpha or chitinase alpha-2 activity in the body, 
which involves administering to such an individual a composition comprising an 
antagonist to chitinase alpha and/or chitinase alpha-2 such as anti-chitinase alpha- 
or anti-chitinase alpha-2-antibodies. 

It is also believed by the inventors that the chitinase alpha polypeptide and 
the chitinase alpha-2 polypeptide of the invention can be used to treat fungal 
infection, as chitin is a major component of the fungal cell wall, and chitnases 



cleave this polymer. Thus, an additional aspect of the invention is related to a 
method for treating fungal infection, which involves administering to such an 
indivdual a composition comprising a chitinase alpha polypeptide or a chitinase 
alpha-2 polypeptide of the invention. 

Further, for a number of disorders, it is believed by the inventors that 
significantly higher or lower levels of chitinase alpha and/or chitinase alpha-2 gene 
expression can be detected in bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to 
a "standard" chitinase alpha or chitinase alpha-2 gene expression level, i.e., the 
chitinase alpha or chitinase alpha-2 gene expression level in bodily fluids from an 
individual not having the disorder. Thus, the invention provides a diagnostic 
method useful during diagnosis of a disorder related to an abnormal level of 
chitinase alpha or chitinase alpha-2 gene expression, which involves (a) assaying 
chitinase alpha or chitinase alpha-2 gene expression level in cells or body fluid of 
that individual; (b) comparing that chitinase alpha or chitinase alpha-2 gene 
expression level with a standard chitinase alpha or chitinase alpha-2 gene 
expression level, whereby an increase or decrease in the assayed chitinase alpha 
or chitinase alpha-2 gene expression level compared to the standard expression 
level is indicative of a disorder. 

Brief Description of the Figures 

Figure 1 provides the coding sequence (SEQ ID NO: 1) and deduced 
amino acid sequence (SEQ ID NO;2) of the polypeptide chitinase alpha. 

Figure 2 provides the coding sequence (SEQ ID NO:3) and deduced 
amino acid sequence (SEQ ID NO;4) of a chitinase alpha variant. The nucleotide 
sequence is modified from Figure 1 at positions 607-609. 

Figure 3 shows regions of similarity between the amino acid sequences of 
chitinase alpha (HPMFW51) (SEQ ID NO:4), HC gp-39 (SEQ ID NO:5), and 
chitotriosidase (SEQ ID NO:6). 
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Figure 4 shows an analysis of the chitinase alpha amino acid sequence. 
In particular, the Kyte-Doolittle hydrophilicity plot and the Jameson-Wolf 
antigenic inex are provided. Amino acid residues Leu38-Ala51 ; Asn66-Phe85; 
Asnll7-Lysl27; Metl45-Leul53; Asnl65-Aspl72; Tyrl77-Hisl85; Phe245- 
Ile254; Gly256-Tyr274; Lys286-Tyr293; Gly366-Lys376; and Asp394-Pro407 
in Figure 1 are highly antigenic regions of the chitinase alpha protein. 

Figure 5 provides the coding sequence (SEQ ID NO:l 1) and deduced 
amino acid sequence (SEQ ID NO: 12) of a shortened chitinase alpha variant - 
chitinase alpha-2. 

Figures 6(a-d) provide a comparison of the nucleotide (a-c) and deduced 
amino acid (d) sequences of chitinase alpha-2 and chitinase alpha. 

Detailed Description of the Invention 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding the chitinase alpha protein having the amino- acid 
sequence shown in Figure 1 [SEQ ID NO:2] which was determined by 
sequencing a cloned cDNA. The nucleotide sequence shown in Figure 1 [SEQ ID 
NO:l) was obtained by sequencing the hpmfwSl cDNA clone encoding a 
chitinase alpha polypeptide, which was deposited on May 3 f 1996 at the 
American Type Culture Collection, 12301 Park Lawn Drive, Rockville, Maryland 
20852, and given accession number 97529. The deposited clone is contained in 
the pBluescript SK(-) plasmid (Stratagene, LaJolla, CA). 

The present invention further provides isolated nucleic acid molecules 
comprising a polynucleotide encoding the chitinase alpha-2 protein having the 
amino acid sequence shown in Figure 5 [SEQ ID NO: 12]. 
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Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DNA molecule herein were determined using an automated DNA 
sequencer (such as the Model 373 from Applied Biosystems, Inc.), and all amino 
S acid sequences of polypeptides encoded by DNA molecules determined herein 

were predicted by translation of a DNA sequence determined as above. 
Therefore, as is known in the art for any DNA sequence determined by this 
automated approach, any nucleotide sequence determined herein may contain a 
some errors. Nucleotide sequences determined by automation are typically at least 

10 about 90% identical, more typically at least about 95% to at least about 99.9% 

identical to the actual nucleotide sequence of the sequenced DNA molecule. The 
actual sequence can be more precisely determined by other approaches including 
manual DNA sequencing methods well known in the art. As is also known in the 
art, a single insertion or deletion in a determined nucleotide sequence compared 

IS to the actual sequence will cause a frame shift in translation of the nucleotide 

sequence such that the predicted amino acid sequence encoded by a determined 
nucleotide sequence will be completely different from the amino acid sequence 
actually encoded by the sequenced DNA molecule, beginning at the point of such 
an insertion or deletion. 

20 Unless otherwise indicated, each "nucleotide sequence" set forth herein is 

presented as a sequence of deoxyribonucleotides (abbreviated A, G f C and T). 
However, by "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 

25 corresponding sequence of ribonucleotides (A, G, C and U) where each thymidine 

deoxynucleotide (T) in the specified deoxynucleotide sequence in is replaced by 
the ribonucleotide uridine (U). For instance, reference to an RNA molecule 
having the sequence of SEQ ID NO:l set forth using deoxyribonucleotide 
abbreviations is intended to indicate an RNA molecule having a sequence in 

30 which each deoxynucleotide A, G or C of SEQ ID NO: 1 has been replaced by the 
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corresponding ribonucleotide A, G or C, and each deoxynucleotide T has been 
replaced by a ribonucleotide U. 

Using the information provided herein, such as the nucleotide sequence in 
Figure 1 [SEQ ID NO: 1], Figure 2 [SEQ ED NO:3], or Figure 5 [SEQ ID NO: 1 1], 
a nucleic acid molecule of the present invention encoding a chitinase alpha or 
chitinase alpha-2 polypeptide may be obtained using standard cloning and 
screening procedures, such as those for cloning cDNAs using mRNA as starting 
material. Illustrative of the invention, the nucleic acid molecule described in 
Figure 1 [SEQ ID NO: 1 ] was discovered in a cDNA library derived from human 
placenta tissue. The determined nucleotide sequence of the chitinase alpha cDN A 
of Figure 1 contains an Open reading frame encoding a protein of about 421 amino 
acid residues with an initiation codon at positions 82*84 of the nucleotide 
sequence shown in Figure 1 [SEQ ID NO. 1], and a predicted leader sequence of 
about 26 amino acid residues, and a deduced molecular weight of about 44 kDa 
without the leader and about 46.6 kDa with the leader. The amino acid sequence 
of the predicted mature chitinase alpha protein is also shown in Figure 1 [SEQ ID 
NO:2] from about amino acid residue 27 to about residue 421. The chitinase 
alpha protein shown in Figure 1 [SEQ ID NO: 2] is about 52% identical and about 
68% similar to human gp-39, and about 52% identical and about 70% similar to 
chitotriosidase (Figure 3). 

The determined nucleotide sequence of the chitinase alpha-2 cDN A of 
Figure 5 contains an open reading frame encoding a protein of about 385 amino 
acid residues with an initiation codon at positions 27-29 of the nucleotide 
sequence shown in Figure 5 [SEQ ID NO: 1 1], and a predicted leader sequence 
of about 20 amino acid residues. The amino acid sequence of the predicted 
mature chitinase alpha-2 protein is also shown in Figure 5 [SEQ ID NO: 12] from 
about amino add residue 21 to about residue 385. The chitinase alpha-2 
nucleotide sequence shown in Figure 5 [SEQ ID NO:l 1] is about 99% identical 
and about 99 % similar to the chitinase alpha nucleotide sequence shown in Figure 
1 [SEQ ID NO: 1]. 
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As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors discussed above, as well as the variability of cleavage sites for 
leaders in different known proteins, the actual chitinase alpha polypeptide encoded 
by the deposited cDNA comprises about 421 amino acids, but may be anywhere 
5 in the range of 410-432 amino adds; and the actual leader sequence of this protein 

is about 26 amino acids, but may be anywhere in the range of about 21 to about 
31 amino acids. Similarly, the actual chitinase alpha-2 polypeptide comprises 
about 385 amino adds, but may be anywhere in the range of 375-395 amino adds; 
and the actual leader sequence of this protein is about 20 amino acids, but may be 

10 .... anywhere in the range of about 16 to about 25 amino adds. 

As indicated, nucleic acid molecules of the present invention may be in the 
form of RNA, such as mRNA, or in the form of DNA, including, for instance, 
cDNA and genomic DNA obtained by cloning or produced synthetically. The 
DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA 

15 may be the coding strand, also known as the sense strand, or it may be the non- 

coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic add molecule(s) is intended a nucleic acidimolecule, 
DNA or RNA, which has been removed from its native environment For example, 
recombinant DNA molecules contained in a vector are considered isolated for the 

20 purposes of the present invention. Further examples of isolated DNA molecules 

include recombinant DNA molecules maintained in heterologous host cells or 
purified (partially or substantially) DNA molecules in solution. Isolated RNA 
molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the 
present invention. Isolated nucleic acid molecules according to the present 

25 invention further include such molecules produced synthetically. 

Isolated nucldc add molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) with an initiation codon at 
positions 82-84 of the nudeotide sequence shown in Figure 1 [SEQ ID NO: 1] or 
Figure 2 [SEQ ID NO:3]; DNA molecules comprising the coding sequence for the 

30 mature chitinase alpha protein shown in Figure 1 (last 395 amino adds) [SEQ ID 

NO:2] or Figure 2 [SEQ ID NO:4] (last 395 amino adds); DNA molecules 
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comprising an open reading frame (ORF) with an initiati n codon at positions 27- 
29 of the nucleotide sequence shown in Figure S [SEQ ID NO: II]; DNA 
molecules comprising the coding sequence for the mature chitinase alpha-2 protein 
shown in Figure 5 (last 365 amino acids) [SEQ ID NO: 12]; and DNA molecules 
which comprise a sequence substantially different from those described above but 
which, due to the degeneracy of the genetic code, still encode the chitinase alpha 
protein. Of course, the genetic code is well known in the art. Thus, it would be 
routine for one skilled in the art to generate the degenerate variants described 
above. 

In another aspect, the invention provides isolated nucleic acid molecules 
encoding the chitinase alpha polypeptide having an amino acid sequence encoded 
by the cDNA clone contained in the plasrnid deposited as ATCC Deposit No 
97529 on May 3, 1996, Preferably, this nucleic acid molecule will encode the 
mature polypeptide encoded by the above-described deposited cDNA clone. The 
invention further provides an isolated nucleic acid molecule having the nucleotide 
sequence shown in Figure 1 [SEQ ID NO:l] Or Figure 2 [SEQ ID NO:3] or 
Figure 5 [SEQ ID NO:l 1] or the nucleotide sequence of the chitinase alpha cDNA 
contained in the above-described deposited clone, or nucleic acid molecule having 
a sequence complementary to one of the above sequences. Such isolated 
molecules, particularly DNA molecules, are useful as probes for gene mapping by 
in situ hybridization with chromosomes and for detecting expression of the 
chitinase alpha gene in human tissue, for instance, by Northern blot analysis. 

The present invention is further directed to fragments of the isolated 
nucleic add molecules described herein. By a fragment of an isolated nucleic acid 
molecule having the nucleotide sequence of the deposited cDNA or the nucleotide 
sequence shown in Figure 1 (SEQ ID NO: 1), Figure 2 (SEQ ID NO:3), or Figure 
S (SEQ ID NO: 11), is intended fragments at least about 15 nt, and more 
preferably at least about 20 nt, still more preferably at least about 30 nt, and even 
more preferably, at least about 40 nt in length which are useful as diagnostic 
probes and primers as discussed herein. Of course, larger fragments 50-1400 nt 
in length are also useful according to the present invention as are fragments 
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corresponding to most, if not all, of the nucleotide sequence of the deposited 
cDNA or as shown in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:3), or 
Figure 5 (SEQ ID NO:l 1). By a fragment at least 20 nt in length, for example, 
is intended fragments which include 20 or more contiguous bases from the 
S nucleotide sequence of the deposited cDNA or the nucleotide sequence as shown 

in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:3), or Figure 5 (SEQ ID 
NO:l 1). Since the gene has been deposited and the nucleotide sequences shown 
in Figure 1 (SEQ ID NO:l), Figure 2 (SEQ ID NO:3) and Figure 5 (SEQ ID 
NO: 11) are provided, generating such DNA fragments would be routine to the 

10 _ ? skilled artisan. For example, restriction endonuclease cleavage or shearing by 
v sonication could easily be used to generate fragments of various sizes. 
Alternatively, such fragments could be generated synthetically. 

Preferred nucleic acid fragments of the present invention include nucleic 
acid molecules encoding epitope-bearing portions of the chitinase alpha and 

IS chitinase alpha-2 proteins. In particular, such nucleic acid fragments of the 

present invention include nucleic add molecules encoding: a polypeptide 
comprising amino acid residues from about 38 to about 51 in Figure t j( SEQ ID 
NO:2); a polypeptide comprising amino acid residues from about 66 to about 85 
in Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues from 

20 about 117 to about 127 in Figure 1 (SEQ ID NO:2); a polypeptide comprising 

amino acid residues from about 145 to about 153 in Figure 1 (SEQ ID NO:2); a 
polypeptide comprising amino add residues from about 165 to about 172 in 
Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues from 
about 177 to about 185 in Figure 1 (SEQ ID NO:2); a polypeptide comprising 

25 amino acid residues from about 245 to about 254 in Figure 1 (SEQ ID NO:2); a 

polypeptide comprising amino acid residues from about 256 t9 about 274 in 
Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues from 
about 286 to about 293 in Figure 1 (SEQ ID NO:2); a polypeptide comprising 
amino acid residues from about 366 to about 376 in Figure 1 (SEQ ID NO:2); and 

30 a polypeptide comprising amino acid residues from about 394 to about 407 in 

Figure I (SEQ ID NO:2). 
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The inventors have determined that the above polypeptide fragments are 
antigenic regions of the chitinase alpha protein. Methods for determining other 
such epitope-bearing portions of the chitinase alpha protein are described in detail 
below. 

In another aspect, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of the polynucleotide in a nucleic acid molecule of the 
invention described above, for instance, the cDNA clone contained in ATCC 
Deposit 97S29. By "stringent hybridization conditions" is intended overnight 
incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (150 mM 
NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 ng/ml denatured, sheared salmon sperm 
DNA, followed by washing the filters in O.lx SSC at about 65°C. By a 
polynucleotide which hybridizes to a "portion" of a polynucleotide is intended a 
polynucleotide (either DNA or RNA) hybridizing to at least about 15 nucleotides 
(nt), and more preferably at least about 20 nt, still more preferably at least about 
30 nt, and even more preferably about 30-70 nt of the reference polynucleotide. 
These are useful as diagnostic probes and primers as discussed above and in more 
detail below. 

Of course, polynucleotides hybridizing to a larger portion of the reference 
polynucleotide (e.g., the deposited cDNA clone), for instance, a portion 50-750 
nt in length, or even to the entire length of the reference polynucleotide, also 
useful as probes according to the present invention, as are polynucleotides 
corresponding to most, if not all, of the nucleotide sequence of the deposited 
cDNA or the nucleotide sequence as shown in Figure 1 [SEQ ID NO: 1 ], Figure 
2 [SEQ ID NO:3] or Figure 5 [SEQ ID NO:l 1]. By a portion of a polynucleotide 
of "at least 20 nt in length," for example, is intended 20 or more contiguous 
nucleotides from the nucleotide sequence of the reference polynucleotide, (e.g., 
the deposited cDNA or the nucleotide sequence as shown in Figure 1 [SEQ ID 
NO:l], Figure 2 [SEQ ID NO:3] or Figure 5 [SEQ ID NO:l 1]). As indicated, 
such portions are useful diagnosticaliy either as a probe according to conventional 
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DNA hybridization techniques or as primers for amplification of a target sequence 
by the polymerase chain reaction (PCR), as desribed, for instance, in Molecular 
Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J. et al^ eds., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989), the 
entire disclosure of which is hereby incorporated herein by reference. 

Since a chitinase alpha cDNA done has been deposited and its determined 
nucleotide sequence is provided in Figure 1 [SEQ ID NO:l] and the nucleotide 
sequence of a variant is provided in Figure 2 {SEQ ID NO:3], generating 
polynucleotides which hybridize to a portion of the chitinase alpha cDNA 
molecule would be routine to the skilled artisan. Similarly, since the determined 
nucleotide sequence for the chitinase alpha-2 protein is provided in Figure S [SEQ 
ID NO: 11], generating polynucleotides which hybridize to a portion of the 
chitinase alpha-2 cDNA molecule would be routine to the skilled artisan. For 
example, restriction endonuclease cleavage or shearing by sonication of the 
chitinase alpha cDNA clone or chitinase alpha-2 cDNA clone could easily be used 
to generate DNA portions of various sizes which are polynucleotides that 
hybridize to a portion of the chitinase alpha or chitinase alpha-2 cDNA^moIecule. 
Alternatively, the hybridizing polynucleotides of the present invention could be 
generated synthetically according to known techniques. Of course, a 
polynucleotide which hybridizes only to a poly A sequence (such as the 3 ' terminal 
poly(A) tract of the chitinase alpha cDNA shown in Figure 1 [SEQ ID NO: 1]) or 
Figure 2 [SEQ ID N0;3], or to a complementary stretch of T (or U) resides, 
would not be included in a polynucleotide of the invention used to hybridize to a 
portion of a nucleic acid of the invention, since such a polynucleotide would 
hybridize to any nucleic acid molecule contain a poly (A) stretch or the 
complement thereof (e.g., practially any double-standed cDNA clone). 

As indicated, nucleic acid molecules of the present invention which encode 
the chitinase alpha or chitinase alpha-2 polypeptide may include, but are not 
limited to those encoding the amino acid sequence of the mature polypeptide, by 
itself; the coding sequence for the mature polypeptide and additional sequences, 
such as those encoding the about 26 amino acid leader or secretory sequence, such 
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as a pre-, r pro- or prepro- protein sequence; the coding sequence of the mature 
polypeptide, with or without the aforementioned additional coding sequences, 
together with additional, non-coding sequences, including for example, but not 
limited to introns and non-coding 5' and 3' sequences, such as the transcribed, 
non-translated sequences that play a role in transcription, mRNA processing - 
including splicing and polyadenylation signals, for example - ribosome binding and 
stability of mRNA; an additional coding sequence which codes for additional 
amino acids, such as those which provide additional functionalities. Thus, the 
sequence encoding the polypeptide may be fused to a marker sequence, such as 
a sequence encoding a peptide which facilitates purification of the fused 
polypeptide. In certain preferred embodiments of this aspect of the invention, the 
marker amino acid sequence is a hexa-histidine peptide, such as the tag provided 
in a pQE vector (Qiagen, Inc.), among others, many of which are commercially 
available. As described in Gentz et al. % Proc. Nail. Acad. ScL (USA) 55:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the 
fusion protein. The **HA n tag is another peptide useful for purification which 
corresponds to an epitope derived from the influenza hemagglutinin proteiiv/ which 
has been described by Wilson et aL 9 Celt 37:767 (1984). 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or derivatives 
of the chitinase alpha or chitinase aIpha-2 protein. Variants may occur naturally, 
such as a natural allelic variant. By an "allelic variant" is intended one of several 
alternate forms of a gene occupying a given locus on a chromosome of an 
orgnanism. Genes II 9 Lewin, ed. Non-naturally occurring variants may be 
produced using art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, deletions 
or additions. The substitutions, deletions or additions may involve one or more 
nucleotides. The variants may be altered in coding or non-coding regions or both. 
Alterations in the coding regions may produce conservative or non-conservative 
amino acid substitutions, deletions or additions. Especially preferred among these 
are silent substitutions, additions and deletions, which do not alter the properties 
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thereof. Also especially preferred in this regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 90% 
5 identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical to 

(a) a nucleotide sequence encoding the full-length chitinase alpha polypeptide 
having the complete amino acid sequence in Figure 1 [SEQ ID NO:2] or Figure 
2 [SEQ ID NO:4] including the predicted leader sequence; (b) a nucleotide 
sequence encoding the mature chitinase alpha polypeptide (full-length polypeptide 

10 with the leader removed) having the amino acid sequence at positions 27-421 in 

Figure 1 [SEQ ED 1nTO:2] or Figure 2 [SEQ ID NO:4]; a nucleotide sequence 
encoding the full-length chitinase alpha polypeptide having the complete amino 
acid sequence including the leader encoded by the cDNA clone contained in 
ATCC Deposit No. 97529; (d) a nucleotide sequence encoding the mature 

1 5 chitinase alpha polypeptide having the amino add sequence encoded by the cDNA 

clone contained in ATCC Deposit No. 97529; (e) a nucleotide sequence encoding 
the chitinase alpha-2 polypeptide having the complete amino acid sequence shown 
in Figure 5 [SEQ ID NO: 12]; (f) a nucleotide sequence encoding the mature 
chitinase alpha-2 polypeptide having the amino acid sequence at positions 21-385 

20 in Figure 5 [SEQ ID NO: 12]; and (g) a nucleotide sequence complementary to any 

of the nucleotide sequences in (a), (b), (c), (d), (e), or (f). 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding an chitinase alpha or 
chitinase alpha-2 polypeptide is intended that the nucleotide sequence of the 

25 polynucleotide is identical to the reference sequence except that the polynucleotide 

sequence may include up to five point mutations per each 100 nucleotides of the 
reference nucleotide sequence encoding the chitinase alpha or chitinase alpha-2 
polypeptide. In other words, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 

30 the nucleotides in the reference sequence may be deleted or substituted with 

another nucleotide, or a number of nucleotides up to 5% of the total nucleotides 
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in the reference sequence may be inserted into the reference sequence. These 
mutations of the reference sequence may occur at the 5' or 3' terminal positions 
of the reference nucleotide sequence or anywhere between those terminal 
positions, interspersed either individually among nucleotides in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 90%, 95%, 97%, 98% or 99% identical to, for instance, the nucleotide 
sequence shown in Figure 1, Figure 2, or Figures or to the nucleotides sequence 
of the deposited cDNA clone can be determined conventionally using known 
computer programs such as the Bestfit program (Wisconsin Sequence Analysis 
Package, Version 8 for Unix, Genetics Computer Group, University Research 
Park, 575 Science Drive, Madison, WI 5371 1). Bestfit uses the local homology 
algorithm of Smith and Waterman {Advances in Applied Mathematics 2:482-489 
(1981)) to find the best segment of homology between two sequences. When 
using Bestfit or any other sequence alignment program to determine whether a 
particular sequence is, for instance, 95% identical to a reference sequence 
according to the present invention, the parameters are set, of course, such that the 
percentage of identity is calculated over the full length of the reference nucleotide 
sequence and that gaps in homology of up to 5% of the total number of 
nucleotides in the reference sequence are allowed. 

The present application is directed to nucleic acid molecules at least 90%, 
95%, 97%, 98% or 99% identical to the nucleic acid sequence shown in Figure 1 
[SEQ ID NO: 1] or Figure 2 [SEQ ID NO:3] or Figure 5 [SEQ ID NO: 1 1] or to 
the nucleic acid sequence of the deposited cDNA, irrespective of whether they 
encode a polypeptide having chitinase alpha or chitinase alpha-2 activity. This is 
because, even where a particular nucleic acid molecule does not encode a 
polypeptide having such activity, one of skill in the art would still know how to 
use the nucleic acid molecule, for instance, as a hybridization probe or a 
polymerase chain reaction (PCR) primer. Uses of the nucleic acid molecules of 
the present invention that do not encode a polypeptide having chitinase alpha or 
chitinase alpha-2 activity include, inter alia, (1) isolating the chitinase alpha or 
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chitinase alpha-2 gene or allelic variants thereof in a cDNA library; (2) in situ 
hybridizati n (e.g., "FISH") to metaphase chr mosomal spreads to pr vide 
precise chromosomal location of the chitinase alpha or chitinase alpha-2 gene as 
described in Verma et aL 9 Human Chromosomes: a Manual of Bessie Techniques, 
Pergamon Press, New York (1988); and Northern Blot analysis for detecting 
chitinase alpha or chitinase alpha-2 mRNA expression in specific cell types (e.g., 
placenta). 

Preferred, however, are nucleic acid molecules having sequences at least 
90%, 95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown 
in Figure 1 [SEQ ID NO:l] or to the nucleic acid sequence of the deposited 
cDNA which do, in feet, encode a polypeptide having chitinase alpha or chitinase 
alpha-2 protein activity. By "a polypeptide having chitinase alpha or chitinase 
alpha-2 activity" is intended polypeptides exhibiting activity similar, but not 
necessarily identical, to an activity of the chitinase alpha or chitinase alpha-2 
protein of the invention (either the full-length protein or, preferably, the mature 
protein) as measured in a particular biological assay. One such assay for chitinase 
activity is the assay described by Dickinson et aL 9 J. Gen. Microbiol i£5: 141 7- 
1421 (1989). 

Of course, due to the degeneracy of the genetic code, one of ordinary skill 
in the art will immediately recognize that a large number of the nucleic acid 
molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% 
identical to the nucleic acid sequence of the deposited cDNA or the nucleic acid 
sequence shown in Figure 1 [SEQ ID NO:l] or Figure 2 [SEQ ID NO:3] or 
Figure 5 [SEQ ID NO: 11] will encode a polypeptide "having chitinase alpha or 
chitinase alpha-2 protein activity." In fact, since degenerate variants of these 
nucleotide sequences all encode the same polypeptide, this will be clear to the 
skilled artisan even without performing one of the above-described comparison 
assays. It will be further recognized in the art that, for such nucleic acid molecules 
that are not degenerate variants, a reasonable number will also encode a 
polypeptide having chitinase alpha or chitinase alpha-2 protein activity. This is 
because the skilled artisan is fully aware of amino acid substitutions that are either 
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less likely or not likely to significantly effect protein function (e.g., replacing one 
aliphatic amino acid with a second aliphatic amino add). 

For example, guidance concerning how to make phenotypically silent 
amino acid substitions is provided in Bowie, J.U. et a/., Deciphering the Message 
in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 247:1306- 
1310 (1990), wherein the authors indicate that there are two main approaches for 
studying the tolerance of an amino acid sequence to change. The first method 
relies on the process of evolution, in which mutations are either accepted or 
rejected by natural selection. The second approach uses genetic engineering to 
introduce amino add changes at specific positions of a cloned gene and selections 
or screens to identify sequences that maintain functionality. As the authors state, 
these studies have revealed that proteins are surprisingly tolerant of amino acid 
substitutions. The authors further indicate which amino add changes are likely to 
be permissive at a certain position of the protein. For example, most buried amino 
acid residues require nonpolar side chains, whereas few features of surface side 
diains are generally conserved. Other such phenotypically silent substitutions are 
described in Bowie, J.U., et aL t supra, and the references cited therein. 

Thus, in accordance with another aspect of the invention, the following 
nucleotide sequence variations of the chitinase alpha gene in Figure 1 [SEQ ID 
NO:l] are provided: the nucleotides at positions 607-609 in Figure 1 [SEQ ID 
NO:l] are changed to either GAA or GAG. This results in an amino acid 
substitution of glutamic add for isoleucine. Figure 2 [SEQ ID NO:3] depicts this 
sequence variant, which is expected to encode a peptide having chitinolytic 
activity. 

Vectors and Host Cells 

The present invention also relates to vectors which indude the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of chitinase alpha 
polypeptides or portions thereof by recombinant techniques. 



23 

Recombinant constructs may be introduced into host cells using well 
known techniques such as infection, transduction, transfection, transvection, 
electroporation and transformation. The vector may be, for example, a phage, 
plasmid, viral or retroviral vector. Retroviral vectors may be replication 
competent or replication defective. In the latter case, viral propagation generally 
will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in a 
precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising cis-acting control regions to the 
polynucleotide of interest. Appropriate transacting factors may be supplied by 
the host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for 
specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 
episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast eptsomes, yeast chromosomal elements, viruses such as 
baculovinises, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations 
thereof, such as cosmids and phagemids. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the R coli lac y trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
constructs will further contain sites for transcription initiation, tennination and, in 
the transcribed region, a ribosome binding ate for translation. The coding portion 
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of the mature transcripts expressed by the constructs will include a translation 
initiating AUG at the beginning and a termination codon appropriately positioned 
at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
5 selectable marker. Such markers include dihydrofolate reductase or neomycin 

resistance for eukaiyotic cell culture and tetracycline or ampicillin resistance genes 
for cuhuring in & coli and other bacteria. Representative examples of appropriate 
hosts include bacterial cells, such as K coli* Streptomyces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila 
10 S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes 

melanoma cells; and plant cells. Appropriate culture media and conditions for the 
above-described host 'cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and 
pQE-9, available from Qiagen; pBS vectors, Phagescript vectors, Bluescript 
15 vectors, pNH8A, pNH16a, pNH18A, pNH46A, available from Stratagene; and 

ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia 
Among preferred eukaiyotic vectors are pWLNEO, pSV2CAT, pOG44^>XTl 
and pSG available from Stratagene; pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia, and pA2 available from Qiagen. Other suitable vectors will be 
20 readily apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention 
include the £L coli lad and lacL promoters, the T3 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable 
eukaryotic promoters include the CMV immediate early promoter, the HSV 
25 thymidine kinase promoter, the early and late S V40 promoters, the promoters of 

retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and 
metallothionein promoters, such as the mouse metallothionein-I promoter. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid- 
30 mediated transfection, electroporation, transduction, infection or other methods. 



25 

Such methods are described in many standard laboratory manuals, such as Davis 
et aL, Basic Methods in Molecular Biology (1986). 

Transcription of the DNA encoding the polypeptides of the present 
invention by higher eukaryotes may be increased by inserting an enhancer 
sequence into the vector. Enhancers are cis-acting elements of DNA, usually 
about from 10 to 300 bp that act to increase transcriptional activity of a promoter 
in a given host cell-type. Examples of enhancers include the SV40 enhancer, 
which is located on the late side of the replication origin at bp 100 to 270, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side 
of the replication origin, and adenovirus enhancers. 

For secretion of the translated protein into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 
appropriate secretion signals may be incorporated into the expressed polypeptide. 
The signals may be endogenous to the polypeptide or they may be heterologous 
signals. 

Thus, the polypeptides of the invention may be expressed in a modified 
form, such as a fusion protein, and may include not only secretion signafc.but also 
additional heterologous functional regions. For instance, a region of additional 
amino acids, particularly charged amino acids, may be added to the N-terminus of 
the polypeptides to improve stability and persistence in the host cell, during 
purification or during subsequent handling and storage. Also, region(s) also may 
be added to the polypeptides to facilitate purification. Such regions may be 
removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques in 
the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize polypeptides. For example, EP-A-O 
464 533 (Canadian counterpart 2045869) discloses fusion proteins comprising 
various portions f constant region of immunoglobin molecules together with 
another human protein or part thereof. In many cases, the Fc part in fusion 
protein is thoroughly advantageous for use in therapy and diagnosis and thus 
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results, f r example, in improved pharmacokinetic properties (EP-A 0232 262). 
On the other hand, for some uses it would be desirable to be able to delete the Fc 
part after the fusion protein has been expressed, detected and purified in the 
advantageous manner described. This is the case when Fc portion proves to be 
a hindrance to use in therapy and diagnosis, for example when the fusion protein 
is to be used as antigen for immunizations. In drug discovery, for example, human 
proteins, such as, hIL5- has been fused with Fc portions for the purpose of high- 
throughput screening assays to identify antagonists of HL-5. See, Bennett, D. et 
al % Journal of Molecular Recognition 5:52-58 (1995) and Johanson, K 5 et aL. 
The Journal of Biological Chemistry 270(16):9459-9471 (1995). 

The chitinase alpha or chitinase alpha-2 protein can be recovered and 
purified from recombinant cell cultures by well-known methods including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high performance 
liquid chromatography ( tt HPLC*) is employed for purification. 

Polypeptides of the present invention include naturally purified products, 
products of chemical synthetic procedures, and products produced by recombinant 
techniques from a prokaryotic or eukaryotic host, including, for example, 
bacterial, yeast, higher plant, insect and mammalian cells. Depending upon the 
host employed in a recombinant production procedure, the polypeptides of the 
present invention may be glycosylated or may be non-glycosylated. In addition, 
polypeptides of the invention may also include an initial modified methionine 
residue, in some cases as a result of host-mediated processes. 

Chitinase Alpha and Chitinase Alpha-2 Polypeptides and Peptides 

The invention further provides an isolated chitinase alpha or chitinase 
alpha-2 polypeptide having the amino acid sequence encoded by the deposited 
cDNA, or the amino acid sequence in Figure 1 [SEQ ID NO:2], Figure 2 [SEQ 
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ID NO: 4], or Figure 5 [SEQ ID NO: 12] r a peptide or polypeptide comprising 
a portion of the above polypeptides. The terms "peptide** and "oligopeptide** are 
considered synonymous (as is commonly recognized) and each term can be used 
interchangeably as the context requires to indicate a chain of at least to amino 
acids coupled by peptidyl linkages. The word "polypeptide** is used herein for 
chains containing more than ten amino acid residues. All oligopeptide and 
polypeptide formulas or sequences herein are written from left to right and in the 
direction from amino terminus to carboxy terminus. 

It will be recognized in the art that some amino acid sequence of the 
chitinase alpha or chitinase alpha-2 polypeptide can be varied without significant 
effect of the structure or function of the protein. If such differences in sequence 
are contemplated, it should be remembered that there will be critical areas on the 
protein which determine activity. In general, it is possible to replace residues 
which form the tertiary structure, provided that residues performing a similar 
function are used. In other instances, the type of residue may be completely 
unimportant if the alteration occurs at a non-critical region of the protean. 

Thus, the invention further includes variations of the chitinase 'alpha and 
chitinase alpha-2 polypeptides which show substantial chitinase alpha or chitinase 
alpha-2 polypeptide activity. Such mutants include deletions, insertions, 
inversions, repeats, and type substitutions (for example, substituting one 
hydrophilic residue for another, but not strongly hydrophilic for strongly 
hydrophobic as a rule). Small changes or such "neutral*' amino acid substitutions 
will generally have little effect on activity. 

Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino adds Ala, Val, Leu and lie; interchange of the 
hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, 
substitution between the amide residues Asn and Gin, exchange of the basic 
residues Lys and Arg and replacements among the aromatic residues Phe, Tyr. 

A particularly preferred variant of the chitinase alpha in Figure 1 [SEQ ID 
NO:2] are those which result in an amino acid substitution of glutamic acid for 
isoleudne at amino acid position 176 (i.e., the nucleotides at positions 607 to 609 
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in Figure 1 are changed to either GAA or GAG). Figure 2 [SEQ ID NO:4] 
depicts this preferred sequence variant, which is expected to encode a peptide 
having chitinolytic activity . 

As indicated in detail above, further guidance concerning which amino acid 
changes are likely to be phenotypically silent (i.e., are not likely to have a 
significant deleterious effect on a function) can be found in Bowie, J.U., et aL 9 
Science 2-/7.1306-1310 (1990). 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. A recombinantiy 
produced version of the chitinase alpha or chitinase alpha-2 polypeptide can be 
substantially purified by the one-step method described in Smith and Johnson, 
Gene 67:31-40 (1988). 

The polypeptides of the present invention include the polypeptide encoded 
by the deposited cDNA including the leader, the mature polypeptide encoded by 
the deposited the cDNA minus the leader (i.e., the mature protein), the 
polypeptide of Figure 1 [SEQ ID NO:2], Figure 2 [SEQ ID NO:4] or Figure 5 
[SEQ ID NO: 12] including the leader, the polypeptide of Figure 1 [SEQ ID 
NO:2], Figure 2 [SEQ ID NO:4] or Figure 5 [SEQ ID NO: 12] minus the leader, 
as well as polypeptides which have at least 90% similarity, more preferably at least 
95% similarity, and still more preferably at least 97%, 98% or 99% similarity to 
those described above. Further polypeptides of the present invention include 
polypeptides at least 80% identical, more preferably at least 90% or 95% identical, 
still more preferably at least 97%, 98% or 99% identical to the polypeptide 
encoded by the deposited cDNA, to the polypeptide of SEQ ID NO:2, SEQ ID 
NO:4, or SEQ ID NO: 12 and also include portions of such polypeptides with at 
least 30 amino acids and more preferably at least 50 amino acids. 

By w % similarity" for two polypeptides is intended a similarity score 
produced by comparing the amino acid sequences of the two polypeptides using 
the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Gr up, University Research Park, 575 Science Drive, 
Madison, WI 5371 1) and the default settings for determining similarity. Bestfit 
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uses the local homology algorithm of Smith and Waterman (Advances in Applied 
Mathematics 2:482-489 (1981)) to find the best segment of similarity between 
two sequences. 

By a polypeptide having an amino acid sequence at least, for example, 
5 95% "identical" to a reference amino acid sequence of a chitinase alpha or 

chitinase alpha-2 polypeptide is intended that the amino acid sequence of the 
polypeptide is identical to the reference sequence except that the polypeptide 
sequence may include up to five amino acid alterations per each 100 amino acids 
of the reference amino add of the chitinase alpha polypeptide. In other words, to 

10 obtain a polypeptide having an amino acid sequence at least 95% identical to a 

reference amino add sequence, up to 5% of the amino add residues in the 
reference sequence may be deleted or substituted with another amino acid, or a 
number of amino adds up to 5% of the total amino acid residues in the reference 
sequence may be inserted into the reference sequence. These alterations of the 

15 reference sequence may occur at the amino or carboxy terminal positions of the 

reference amino add sequence or anywhere between those terminal positions, 
interspersed dther individually among residues in the reference sequence or in one 
or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 

20 95%, 97%, 98% or 99% identical to, for instance, the amino add sequence shown 

in Figure 1 [SEQ ID NO:2], Figure 2 [SEQ ID NO:4) or Figure 5 [SEQ ID 
NO: 12], or to the amino add sequence encoded by deposited cDNA clone, can 
be determined conventionally using known computer programs such the Bestfit 
program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics 

25 Computer Group, University Research Park, 575 Science Drive, Madison, WI 

53711). When using Bestfit or any other sequence alignment program to 
determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set, of 
course, such that the percentage of identity is calculated over the full length of the 

30 reference amino add sequence and that gaps in homology of up to 5°/ of the total 

number of amino add residues in the reference sequence are allowed. 
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As described in detail below, the polypeptides f the present invention can 
be used to raise polyclonal and monoclonal antibodies, which are useful in 
diagnostic assays for detecting chitinase alpha and/or chitinase alpha-2 protein 
expression as described below or as agonists and antagonists capable of enhancing 
or inhibiting protein function. Further, such polypeptides can be used in the yeast 
two-hybrid system to "capture" chitinase alpha or chitinase alpha-2 protein binding 
proteins which are also candidate agonist and antagonist according to the present 
invention. The yeast two hybrid system is described in Fields and Song, Nature 
340:245-246(1989). 

In another aspect, the invention provides a peptide or polypeptide 
comprising an epitope-bearing portion of a polypeptide of the invention. The 
epitope of this polypeptide portion is an immunogenic or antigenic epitope of a 
polypeptide of the invention. An "immunogenic epitope" is defined as a part of 
a protein that elicits an antibody response when the whole protein is the 
immunogen. These immunogenic epitopes are believed to be confined to a few 
loci on the molecule. On the other hand, a region of a protein molecule to which 
an antibody can bind is defined as an "antigenic epitope." The number of 
immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, for instance, Geysen, H.M. et aL, Proc. Noli Acad Set (USA) 
57:3998-4002 (1984). 

As to the selection of peptides or polypeptides bearing an antigenic epitope 
(i.e., that contain a region of a protein molecule to which an antibody can bind), 
it is well known in that art that relatively short synthetic peptides that mimic part 
of a protein sequence are routinely capable of eliciting an antiserum that reacts 
with the partially mimicked protein. See, for instance, Sutcliffe, J.G. et aL, 
Science 279:660-666 (1983). Peptides capable of eliciting protein-reactive sera 
are frequently represented in the primary sequence of a protein, can be 
characterized by a set of simple chemical rules, and are confined neither to 
immunodominant regions of intact proteins (Le. f immunogenic epitopes) nor to the 
amino or carboxyl terminals. Peptides that are extremely hydrophobic and those 
of six or fewer residues generally are ineffective at inducing antibodies that bind 
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to the mimicked protein; longer, soluble peptides, especially those containing 
proline residues, usually are effective. Sutcliffe et a/., supra, at 661 . For instance, 
18 of 20 peptides designed according to these guidelines, containing 8-39 residues 
covering 75% of the sequence of the influenza virus hemagglutinin HA1 
polypeptide chain, induced antibodies that reacted with the HAi protein or intact 
virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies 
glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 
specifically to a polypeptide of the invention. Thus, a high proportion of 
hybridomas obtained by fusion of spleen cells from donors immunized with an 
antigen epitope-bearing peptide generally secrete antibody reactive with the native 
protein. Sutcliffe et aL y supra, at 663. The antibodies raised by antigenic epitope- 
bearing peptides or polypeptides are useful to detect the mimicked protein, and 
antibodies to different peptides may be used for tracking the fate of various 
regions of a protein precursor which undergoes posttranslation processing. The 
peptides and anti-peptide antibodies may be used in a variety of qualitative or 
quantitative assays for the mimicked protein, for instance in competition assays 
since it has been shown that even short peptides (e.g., about 9 amino acids) can 
bind and displace the larger peptides in immunoprecipitation assays. See, for 
instance, Wilson, LA. et al y Cell 57:767-778 (1984) at 777. The anti-peptide 
antibodies of the invention also are useful for purification of the mimicked protein, 
for instance, by adsorption chromatography using methods well known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
designed according to the above guidelines preferably contain a sequence of at 
least seven, more preferably at least nine and most preferably between about 15 
to about 30 amino acids contained within the amino acid sequence of a 
polypeptide of the invention. However, peptides or polypeptides comprising a 
larger portion of an amino add sequence of a polypeptide of the invention, 
containing about 30 to about SO amino acids, or any length up to and including the 
entire amino acid sequence of a polypeptide of the invention, also are considered 
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ephope-bearing peptides or polypeptides of the invention and also are useful for 
inducing antibodies that react with the mimicked protein. Preferably, the amino 
acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic 
residues and highly hydrophobic sequences are preferably avoided); and sequences 
containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be 
used to generate chhinase alpha- or chitinase alpha-2-specific antibodies include: 
a polypeptide comprising amino acid residues from about 38 to about 51 in 
Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues from 
about 66 to about 85 in figure 1 (SEQ ID NO:2); a polypeptide comprising amino 
acid residues from about 117 to about 127 in Figure 1 (SEQ ID NO:2); a 
polypeptide comprising amino acid residues from about 145 to about 153 in 
Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues from 
about 165 to about 172 in Figure 1 (SEQ ID NO:2); a polypeptide comprising 
amino add residues from about 177 to about 185 in Figure 1 (SEQ ID NO:2); a 
polypeptide comprising amino acid residues from about 245 to about 254 in 
Figure 1 (SEQ ID NO: 2); a polypeptide comprising amino acid residues from 
about 256 to about 274 in Figure 1 (SEQ ID NO:2); a polypeptide comprising 
amino acid residues from about 286 to about 293 in Figure 1 (SEQ ID NO:2); a 
polypeptide comprising amino acid residues from about 366 to about 376 in 
Figure 1 (SEQ ID NO:2); and a polypeptide comprising amino acid residues from 
about 394 to about 407 in Figure 1 (SEQ ID NO:2). 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides 
including recombinant means using nucleic add molecules of the invention. For 
instance, a short epitope-bearing amino acid sequence may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and 
purification, as well as during immunization to produce anti-peptide antibodies. 
Epitope-bearing peptides also may be synthesized using known methods of 
chemical synthesis. For instance, Houghten has described a simple method for 
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synthesis of large numbers of peptides, such as 10-20 mg of 248 different 13 
residue peptides representing single amino acid variants of a segment of the HA I 
polypeptide which were prepared and characterized (by ELISA-type binding 
studies) in less than four weeks. Houghten, R.A., Proc. Natl Acad Set (USA) 
5 52:5131-5135 (1985). This "Simultaneous Multiple Peptide Synthesis (SMPS)" 

process is further described in U.S. Patent No. 4,631,211 to Houghten et al 
(1986). In this procedure the individual resins for the solid-phase synthesis of 
various peptides are contained in separate solvent-permeable packets, enabling the 
optimal use of the many identical repetitive steps involved in solid-phase methods. 

10 A completely manual procedure allows 500-1000 or more syntheses to be 

conducted simultaneously. Houghten ef ai, supra, at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to 
induce antibodies according to methods well known in the art. See, for instance, 
Sutcliffe et al y supra\ Wilson et al, supra, Chow, M etal, Proc. Natl Acad. ScL 

15 (USA) 52:910-914; and Bittle, FJ. et al, J. Gen. Virol 06:2347-2354 (1985). 

Generally, animals may be immunized with free peptide; however, anti-peptide 
antibody titer may be boosted by coupling of the peptide to a macromolecular 
carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
peptides containing cysteine may be coupled to carrier using a linker such as m- 

20 maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may 

be coupled to carrier using a more general linking agent such as glutaraldehyde. 
Animals such as rabbits, rats and mice are immunized with either free or carrier- 
coupled peptides, for instance, by intraperitoneal and/or intradermal injection of 
emulsions containing about 100 pg peptide or carrier protein and Freund's 

25 adjuvant. Several booster injections may be needed, for instance, at intervals of 

about two weeks, to provide a useful titer of anti-peptide antibody which can be 
detected, for example, by ELISA assay using free peptide adsorbed to a solid 
surface. The titer of anti-peptide antibodies in serum from an immunized animal 
may be increased by selection of anti-peptide antibodies, for instance, by 

30 adsorption to the peptide on a solid support and dution of the selected antibodies 

according to methods well known in the art. 



WO 98/06859 



PCT/US96/13003 



34 

Immun genie epitope-bearing peptides of the invention, i.e., those parts 
of a protein that elicit an antibody response when the whole protein is the 
immunogen, are identified according to methods known in the art. For instance, 
Geysen et a£, 1984, supra 9 discloses a procedure for rapid concurrent synthesis 

S on solid supports of hundreds of peptides of sufficient purity to react in an 

enzyme-linked immunosorbent assay. Interaction of synthesized peptides with 
antibodies is then easily detected without removing them from the support. In this 
manner a peptide bearing an immunogenic epitope of a desired protein may be 
identified routinely by one of ordinary skill in the art. For instance, the 

10 immunologically important epitope in the coat protein of foot-and-mouth disease 

virus was located by Geysen et ah with a resolution of seven amino acids by 
synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 
213 amino acid sequence of the protein. Then, a complete replacement set of 
peptides in which all 20 amino acids were substituted in turn at every position 

IS within the epitope were synthesized, and the particular amino adds conferring < 

specificity for the reaction with antibody were determined. Thus, peptide. analogs 
of the epitope-bearing peptides of the invention can be made routinely by this 
method. U.S. Patent No. 4,708,781 to Geysen (1987) further describes this 
method of identifying a peptide bearing an immunogenic epitope of a desired 

20 protein. 

Further still, U.S. Patent No. 5,194,392 to Geysen (1990) describes a 
general method of detecting or determining this sequence of monomers (amino 
acids or other compounds) which is a topological equivalent of the epitope (Le., 
a <l mimotope") which is complementary to a particular paratope (antigen binding 

25 site) of an antibody of interest. More generally, U.S. Patent No. 4,433,092 to 

Geysen (1989) describes a method of detecting or determining a sequence of 
monomers which is a topographical equivalent of a ligand which is complementary 
to the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent 
No. 5,480,971 to Houghten, R.A. et al (1996) on Peralkylated Oligopeptide 

30 Mixtures discloses linear Ci-CValkyl peralkylated oligopeptides and sets and 

libraries of such peptides, as well as methods for using such oligopeptide sets and 
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libraries for determining the sequence f a peralkyiated oligopeptide that 
preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs 
of the epitope-bearing peptides of the invention also can be made routinely by 
these methods. 

5 The entire disclosure of each document cited in this section on 

"Polypeptides and Peptides" is hereby incorporated herein by reference 

As one of skill in the art will appreciate, chitinase alpha and chitinase 
alpha-2 polypeptides of the present invention and the epitope-bearing fragments 
thereof described above can be combined with parts of the constant domain of 

10 immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 

facilitate purification and show an increased half-life in vivo. This has been 
shown, e.g., for chimeric proteins consisting of the first two domains of the human 
GD4-polypeptide and various domains of the constant regions of the heavy or light 
chains of mammalian immunoglobulins (EPA 394,827; Traunecker et aL, Nature 

15 357:84- 86 (1988)). Fusion proteins that have a disulfide-linked dimeric structure 

due to the IgG part can also be more efficient in binding and neutralizing other 
molecules than the monomelic protein or protein fragment alone (Fountoulakis et 
al. t JBiochem 270:3958-3964 (1995)). 

Chromosome Assays 

20 The nucleic acid molecules of the present invention are also valuable for 

chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 

25 (repeat polymorphisms) are presently available for marking chromosomal location. 

The mapping of DNAs to chromosomes according to the present invention is an 
important first step in correlating those sequences with genes associated with 
disease. 
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In certain preferred embodiments in this regard, the cDNA herein disclosed 
is used to clone genomic DNA of a chitinase alpha protein gene. This can be 
accomplished using a variety of well known techniques and libraries, which 
generally are available commercially. The genomic DNA then is used for in situ 
chromosome mapping using well known techniques for this purpose. Typically, 
in accordance with routine procedures for chromosome mapping, some trial and 
error may be necessary to identify a genomic probe that gives a good in situ 
hybridization signal. 

In some cases, in addition, sequences can be mapped to chromosomes by 
preparing PGR primers (preferably 15-25 bp) from the cDNA. Computer analysis 
of the 3' untranslated region of the gene is used to rapidly select primers that do 
not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those hybrids 
containing the human gene corresponding to the primer will yield an amplified 
portion. ( 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular DNA to a particular chromosome. Using the present invention with the 
same oligonucleotide primers, sublocalization can be achieved with panels of 
portions from specific chromosomes or pools of large genomic clones in an 
analogous manna*. Other mapping strategies that can similarly be used to map to 
its chromosome include in situ hybridization, prescreening with labeled flow- 
sorted chromosomes and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Fluorescence in situ hybridization ("FISH") of a cDNA clone to a 
metaphase chromosomal spread can be used to provide a precise chromosomal 
location in one step. This technique can be used with probes from the cDNA as 
short as 50 or 60 bp. For a review of this technique, see Verma et a/., Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York 
(1988). 
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Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in V. McKusick, Mendelian 
Inheritance in Man, available on-line through Johns Hopkins University, Welch 
Medical Library. The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDNA or genomic 
sequence between affected and unaffected individuals. If a mutation is observed 
in some or all of the affected individuals but not in any normal individuals, then the 
mutation is likely to be the causative agent of the disease. 

With current resolution of physical mapping and genetic mapping 
techniques, a cDNA precisely localized to a chromosomal region associated with 
the disease could be one ofbetween 50 and 500 potential causative genes. (This 
assumes 1 megabase mapping resolution and one gene per 20 kb). 

Identification of Receptor for the Chitinase Alpha or ChitinashAlpha-2 
Ligand 

This invention further provides a method for identification of the receptor 
for the chitinase alpha or chitinase alpha-2 ligand. The gene encoding the receptor 
can be identified by numerous methods known to those of skill in the art, for 
example, ligand panning and FACS sorting (Coligan, et al % Current Protocols in 
Immun. 7(2), Chapter 5, (1991)). Preferably, expression cloning is employed 
wherein polyadenylated RNA is prepared from a cell responsive to the chitinase 
alpha or chitinase alpha-2 ligand, and a cDNA library created from this RNA is 
divided into pools and used to transfect COS cells or other cells that are not 
responsive to the chitinase alpha or chitinase alpha-2 ligand. Transfected cells 
which are grown on glass slides are exposed to labeled chitinase alpha or chitinase 
alpha-2 ligand, which can be labeled by a variety of means including iodination or 
inclusion of a recognition site for a she-specific protein kinase. Following fixation 
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and incubation, the slides are subjected to autoradiographic analysis. Positive 
pools are identified and sub-pools are prepared and re-transfected using an 
iterative sub-pooling and re-screening process, eventually yielding a single clone 
that encodes the putative receptor. 

As an alternative approach for receptor identification, labeled ligand can 
be photoaffinity linked with cell membrane or extract preparations that express the 
receptor molecule. Cross-linked material is resolved by PAGE and exposed to X- 
ray film. The labeled complex containing the ligand-receptor can be excised, 
resolved into peptide fragments, and subjected to protein microsequencing. The 
amino acid sequence obtained from microsequencing would be used to design a 
set of degenerate oligonucleotide probes to screen a cDNA library to identify the 
gene encoding the putative receptor. 

The method for determining whether a ligand can bind to the chitinase 
alpha or chitinase alpha-2 receptor comprises transfecting a cell population (one 
presumed not to contain the receptor) with the appropriate vector expressing the 
chitinase alpha or chitinase alpha-2 receptor, such that the cell will now express 
the receptor. A suitable response system is obtained by transfection of the DNA 
into a suitable host containing the desired second messenger pathways including 
cAMP, ion channels, phosphoinositide kinase, or calcium response. Such a 
transfection system provides a response system to analyze the activity of various 
ligands exposed to the cell. Ligands chosen could be identified through a rational 
approach by taking known ligands that interact with similar types of receptors or 
using small molecules, cell supernatants or extracts or natural products. 

The present invention further provides a method of screening drugs to 
identify those which block (antagonists) interaction of ligand to receptor. A 
chitinase alpha or chitinase alpha-2 antagonist is a compound which eliminates or 
reduces the natural biological functions of chitinase alpha or chitinase alpha-2. As 
an example, a mammalian cell or membrane preparation expressing the chitinase 
alpha or chitinase alpha-2 receptor would be incubated with labeled ligand in the 
presence of the drug. The ability of the drug to block this interaction could then 
be measured. Alternatively, the response of a known second messenger system 
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following interaction of ligand and receptor would be measured and compared in 
the presence or absence of the drug. Such second messenger systems include but 
are not limited to, cAMP guanylate cyclase, ion channels or phosphoinositide 
hydrolysis. 

Potential antagonists include an antibody, or in some cases, an 
oligopeptide, which binds to the polypeptide. Alternatively, a potential antagonist 
may be a closely related protein which binds to the receptor sites, however, they 
are inactive forms of the polypeptide and thereby prevent the action of chitinase 
alpha or chitinase aIpha-2 since receptor sites are occupied. 

Chitinase alpha or chitinase alpha-2 can also be used in the production of 
antibodies. The polypeptides, their fragments or other derivatives, or analogs 
thereof, or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or 
monoclonal antibodies. The present invention also includes chimeric, single chain, 
and humanized antibodies, as well as Fab fragments, or the product of an Fab 
expression library. Various procedures known in the art may be used for the 
production of such antibodies and fragments. 

Antibodies generated against the polypeptides corresponding to a sequence 
of the present invention can be obtained by direct injection of the polypeptides into 
an animal or by administering the polypeptides to an animal, preferably a 
nonhuman. The antibody so obtained wiD then bind the polypeptides itself. In this 
manner, even a sequence encoding only a fragment of the polypeptides can be 
used to generate antibodies binding the whole native polypeptides. Such 
antibodies can then be used to isolate the polypeptide from tissue expressing that 
polypeptide. 

For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples 
include the hybridoma technique (Kohler and Milstein, Nature 256:495-497 
(1975)), the trioma technique, the human B-cell hybridoma technique (Kozbor et 
al. 9 Immunology Today 4:12 (1983)), and the EBV-hybridoma technique to 
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produce human monoclonal antibodies (Colee/ aL , in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc. (1985), pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to 
immunogenic chitinase alpha or chitinase alpha-2 products of this invention. Also, 
transgenic mice may be used to express humanized antibodies to immunogenic 
chitinase alpha or chitinase alpha-2 products of this invention. 

Another potential antagonist is an anti sense construct prepared using 
antisense technology. Antisense technology can be used to control gene 
expression through triple-helix formation or antisense DNA or RNA, both of 
which methods are based on binding of a polynucleotide to DNA or RNA. For 
example, the 5* coding portion of the polynucleotide sequence, which encodes for 
the mature polypeptides of the present invention, is used to design an antisense 
RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA 
oligonucleotide is designed to be complementary to a region of the gene involved 
in transcription (triple helix-see Lee et aL 9 NucL Acids Res. 6:3073 (1979); 
Cooney.e/ a/., Science 247:456 (1988); and Dervan et aL % Science 257:1360 
(1991)), thereby preventing transcription and the production of chitinase alpha. 
The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks 
translation of the mRNA molecule into chitinase alpha or chitincase alpha-2 
polypeptide (Antisense - Okano, J. Neurochem. 56:560 (1991); 
Oligodeaxyrmcleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). The oligonucleotides described above can also be 
delivered to cells such that the antisense RNA or DNA may be expressed in vivo 
to inhibit production of chitinase alpha or chitinase alpha-2. 

Potential antagonists also include a small molecule which binds to and 
occupies the catalytic she of the polypeptide thereby making the catalytic site 
inaccessible to substrate such that normal biological activity is prevented. 
Examples of small molecules include, but are not limited to, small peptides or 
peptide-like molecules. 
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Therapeutic Applications of Ckitinase Alpha, Chitinase Alpha-2 and 
Antagonists Thereof 

Treatment of pathological conditions characterized by elevated 
plasma levels of chitinase alpha or chitinase alpha-2 

The antagonists, described above, may be employed to treat rheumatoid 
arthritis, atherosclerosis, inflammation, or other tissue remodeling diseas 
processes, that may be characterized by elevated plasma levels of chitinase alpha 
and/or chitinase alpha-2. The antagonists may be employed in a composition with 
a pharmaceutically acceptable carrier, e.g., as hereinafter described. 

It is believed that human chitotriosidase is involved in tissue remodeling. 
Human chitotriosidase is homologous to microbial chitinases. In particular, the 
protein shares a similar cysteine motif and homology to the active site of microbial 
chitinases. Accordingly, chitotriosidase can be used in the development of 
treatments of tissue remodeling diseases and in the diagnosis of these diseases. 
Similarly, human chitinase alpha or chitinase alpha-2 can be used the same way. 

It is believed that human serum chitotriosidase is an extracellular matrix 
remodeling factor involved in the remodeling process occurring in diseases. 
Activity against chitotriosidase substrates is indicative of a function in degrading 
extracellular matrix substrates with a similar carbohydrate structure to chitin. 
Chitotriosidase was found during morphological differentiation of monocytes into 
macrophages. These macrophages can be associated with different sites of 
inflammation that occur in various disease states, including but not limited to 
atherosclerosis, rheumatoid arthritis, acute inflammatory disease, and inflammation 
due to injury. 

Accordingly, it is believed that human serum chitotriosidase, and now 
chitinase alpha and chitinase alpha-2, are useful targets for the inhibition of tissue 
remodeling disease processes. Purified human chitinase alpha or chitinase alpha-2 
protein could thus be tested for various in vitro assays for tissue remodeling to 
determine the extent to which this protein promotes these processes. Therapeutics 
would be defined as molecules that specifically inhibit chitinase alpha or chitinase 
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alpha-2 function as it relates to tissue remodeling activity in these assays and 
includes neutralizing antibodies as described above. The invention also relates to 
methods of treating a patient with a tissue remodeling disorder by administering 
a therapeutically effective amount of a chitinase alpha or chitinase alpha-2 
antagonist or antibody. 

Treatment of fungal infection 

In addition, the invention relates to methods of treating fungal infection by 
administering a therapeutically effective amount of chitinase alpha or chitinase 
alpha-2 to a patient. Chitinases expressed in human tissues are expected to confer 
protection against fungi since chitin is a component of the fungal cell wall, and 
chitinase cleaves this polymer. Preferably, the chitinase alpha variant having an 
amino acid sequence shown in Figure 2 (SEQ ID NO: 4) is used for treating 
fungal infection. 

Modes of administration 

Given the activities modulated by chitinase alpha and chitinase alpha-2, it 
is readily apparent that a substantially altered (increased or decreased) level of 
expression of these proteins in an individual compared to the standard or "normal" 
level produces pathological conditions such as those described above. It will also 
be appreciated by one of ordinary skill that, since the chitinase alpha and chitinase 
alpha-2 proteins of the invention are translated with a leader peptide suitable for 
secretion of the mature protein from cells, when chitinase alpha or chitinase alpha- 
2 protein (particularly the mature form) is added from an exogenous source to 
cells, tissues or the body of an individual, the protein will exert its modulating 
activities on any of its target cells. Therefore, it will be appreciated that conditions 
caused by a decrease in the standard or normal level of chitinase alpha or chitinase 
alpha-2 activity in an individual, can be treated by administration of these proteins. 
Thus, the invention further provides a method of treating an individual in need of 
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an increased level of chitinase alpha or chitinase alpha-2 activity comprising 
administering t such an individual a pharmaceutical composition comprising an 
amount of an isolated chitinase alpha and/or chitinase alpha-2 polypeptide of the 
invention, particularly the mature form, effective to increase the chitinase alpha 
and/or chitinase alpha-2 activity level in such an individual. 

The chitninase alpha and/or chitinase polypeptide composition to be used 
in the therapy will be formulated and dosed in a fashion consistent with good 
medical practice, taking into account the clinical condition of the individual patient 
(especially the side effects of treatment with chitinase alpha polypeptide alone), 
the site of delivery of the composition, the method of administration, the 
scheduling of administration, and other factors known to practitioners. The 
"effective amount" of chitinase alpha or chitinase alpha-2 polypeptide for purposes 
herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of 
chitinase alpha or chitinase alpha-2 polypeptide administered parenterally per dose 
will be in the range of about 1 ng/kg/day to 10 mg/kg/day of patient body weight, 
although, as noted above, this will be subject to therapeutic discretion. More 
preferably, this dose is at least 0.01 mg/kg/day, and most preferably for humans 
between about 0.01 and 1 mg/kg/day for the hormone. If given continuously, the 
chitinase alpha or chitinase alpha-2 polypeptide is typically administered at a dose 
rate of about 1 fig/kg/hour to about 50 ^ig/kg/hour, either by 1-4 injections per 
day or by continuous subcutaneous infusions, for example, using a mini-pump. An 
intravenous bag solution may also be employed. 

The chitinase alpha or chitinase alpha-2 polypeptide is also suitably 
administered by sustained-release systems. Suitable examples of sustained-release 
compositions include semi-permeable polymer matrices in the form of shaped 
articles, e.g., films, or mirocapsules. Sustained-release matrices include 
polyiactides (U.S. Pat No. 3,773,919, EP 58,481X copolymers of L-glutamic acid 
and gamma-ethyl-L-glutamate (Sidman, U. et aL t Biopotymers 22:547-356 
(1983)), poly (2- hydroxyethyl methacrylate) (R. Langer et aL, J. BiomecL Mater. 
Res. 75:167-277 (1981), and R. Langer, Chem. Tech 72:98-105 (1982)), ethylene 
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vinyl acetate QL Langer et aL t Id} or poIy-D- (-)-3-hydroxybutyric acid (EP 
133,988). Sustained-release chitinase alpha or chitinase alpha-2 polypeptide 
compositions also include liposomally entrapped polypeptides. Liposomes 
containing chitinase alpha or chitinase aIpha-2 polypeptide are prepared by 
methods known per se: DE 3,218,121; Epstein et aL, Proc. Natl. Acad ScL 
(USA) 52:3688-3692 (1985); Hwang et al p Proc. NatL Acad ScL (USA) 
77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese Pat. Appl. 83-118008; U.S. Pat. Nos. 4,485,045 and 
4^44^545; and EP 102,324. Ordinarily, the liposomes are of the small (about 
200-800 Angstroms) unilamellar type in which the lipid content is greater than 
about 30 mol. percent cholesterol, the selected proportion being adjusted for the 
optimal chitinase alpha or chitinase alpha-2 polypeptide therapy. 

For parenteral administration, in one embodiment, the chitinase alpha or 
chitinase alpha-2 polypeptide is formulated generally by mixing it at the desired 
degree of purity, in a unit dosage injectable form (solution, suspension, or 
emulsion), with a pharmaceutical^ acceptable carrier, i.e., one that is non-toxic 
to recipients at the dosages and concentrations employed and is compatible with 
other ingredients of the formulation. For example, the formulation preferably does 
not include oxidizing agents and other compounds that are known to be 
deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably 
the carrier is a parenteral carrier, more preferably a solution that is isotonic with 
the blood of the recipient. Examples of such carrier vehicles include water, saline, 
Ringer's solution, and dextrose solution. Non-aqueous vehicles such as fixed oils 
and ethyl oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials are non- 
toxic to recipients at the dosages and concentrations employed, and include 
buffo-s such as phosphate, citrate, succinate, acetic acid, and other organic acids 
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or their salts; antioxidants such as ascorbic acid; low molecular weight (less than 
about ten residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such 
as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, 
or arginine; monosaccharides, disaccharides, and other carbohydrates including 
cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; 
and/or nonionic surfactants such as polysorbates, poloxamers, or PEG. 

The chitinase alpha and chitinase alpha-2 polypeptides are typically 
formulated in such vehicles at a concentration of about 0. 1 mg/ml to 100 mg/ml, 
preferably 1-10 mg/ml, at a pH of about 3 to 8. It will be understood that the use 
of certain of the foregoing excipients, carriers, or stabilizers will result in the 
formation of polypeptide salts. 

Polypeptides to be used for therapeutic administration must be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes 
(e.g., 0.2 micron membranes). Therapeutic chitinase alpha and chitianse aIpha-2 
polypeptide compositions generally are placed into a container having* sterile 
access port, for example, an intravenous solution bag or vial having a stopper 
pierceable by a hypodermic injection needle. 

Chitinase alpha and chitinase alpha-2 polypeptides ordinarily will be stored 
in unit or multi-dose containers, for example, sealed ampoules or vials, as an 
aqueous solution or as a lyophilized formulation for reconstitution. As an example 
of a lyophilized formulation, 10-ml vials are filled with 5 ml of sterile-filtered 1% 
(w/v) aqueous chitinase alpha polypeptide solution, and the resulting mixture is 
lyophilized. The infusion solution is prepared by reconstituting the lyophilized 
chitinase alpha polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one 
or more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such containers) can be a notice 
in the form prescribed by a governmental agency regulating the manufacture, use 
or sale of pharmaceuticals or biological products, which notice reflects approval 
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by the agency of manufacture, use or sale for human administration. In addition, 
the polypeptides of the present invention may be employed in conjunction with 
other therapeutic compounds. 

Example 1 

Bacterial Expression and Purification of Chitinase Alpha 

The DNA sequence encoding chitinase alpha, ATCC No. 97529, is initially 
amplified using PCR oligonucleotide primers corresponding to the 5' sequences 
of the processed chitinase alpha protein (minus the signal peptide sequence) and 
the vector sequences 3' to the gene. Additional nucleotides containing restriction 
sites to facilitate cloning are added to the 5' and 3' sequences respectively. 

The 5 1 oligonucleotide primer has the sequence 
CGCGGATCCATGGGAGCAACCACCATGGACCAG (SEQ ID NO: 5) and 
contains a BamHI restriction site followed by 24 nucleotides of chitinase alpha 
coding sequence starting from the presumed terminal amino acid of the processed 
protein codon. 

The 3' primer sequence CGCGGATCCGCTTCTCTGTAAGTTAATCCT 
(SEQ ID NO:6) contains complementary sequences to the BamHI site and is 
followed by 21 nucleotides of chitinase-alpha DNA insert. The restriction sites 
correspond to the restriction enzyme sites on the bacterial expression vector pQE- 
9. (Qiagen, Inc. 9259 Eton Avenue, Chatsworth, CA, 9131 1). pQE-9 encodes 
antibiotic resistance (Amp 1 ), a bacterial origin of replication (ori), an IPTG- 
regulatable promoter operator (P/O), a ribosome binding site (RBS), a 6-His tag 
and restriction enzyme sites. pQE-9 is then digested with BamHI. 

The amplified sequences are ligated into pQE-9 and inserted in frame with 
the sequence encoding for the histidine tag and the RBS. The ligation mixture is 
then used to transform the K coli strain available from Qiagen under the 
trademark MlS/rep 4 by the procedure described in Sambrook, J. et al. 9 
Molecular Cloning: A Laboratory Manual, Cold Spring Laboratory Press, 
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(1989). M15/rep4 contains multiple copies of the plasmid ;pREP4, which 
expresses the iacl repressor and also confers kanamycin resistance (Kan*). 
Transformants are identified by their ability to grow on LB plates and 
ampicillin/kanamycin resistant colonies were selected. Plasmid DNA is isolated 
and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in 
liquid culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 
fig/ml). The O/N culture is used to inoculate a large culture at a ratio of 1: 100 to 
1 :250. The cells were grown to an optical density 600 (O.D.* 00 ) of between 0.4 
and 0.6. IPTG ("Isopropyl-B-D-thiogalacto pyranoside") is then added to a final 
concentration of 1 mM IPTG induces by inactivating the laci repressor, clearing 
the P/O leading to increased gene expression. Cells are grown an extra 3 to 4 
hours. Cells were then harvested by centrifugation. The cell pellet is solubilized 
in the chaotropic agent 6 Molar Guanidine HCI. After clarification, solubilized 
chitinase alpha is purified from this solution by chromatography on a Nickel- 
Chelate column under conditions that allow for tight binding by proteins 
containing the 6-His tag (Hochuli, E. et aL, J. Chromatography 4Jftl77-l$4 
(1984)). Chitinase-alpha (>90% pure) is eluted from the column in 6 molar 
guanidine HCI pH S.O and for the purpose of renaturation adjusted to 3 molar 
guanidine HCI, lOOmM sodium phosphate, 10 mmolar glutathione (reduced) and 
2 mmolar glutathione (oxidized). After incubation in this solution for 12 hours, 
the protein is dialyzed to 10 mmolar sodium phosphate. 

Example 2 

Cloning and Expression of Chitinase Alpha in Baculovirus ~- 

The DNA sequence encoding the full length chitinase alpha protein, ATCC 
No. 97529, is amplified using PCR oligonucleotide primers corresponding to the 
5' and 3* sequences f the gene: 

The 5' primer has the sequence 
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CGCGGATCCGCCATCATGGGAGCAACCACCATG (SEQ ID NO: 7) and 
contains a BamHI restriction site followed by 6 nucleotides resembling an 
efficient signal for the initiation of translation in eukaryotic cells (Kozak, M., J. 
MoL BioL 796:947-950 (1987) which is just behind the first 280 nucleotides of the 
chitinase-alpha gene (the initiation codon for translation W ATG W is underlined). 

The 3' primer has the sequence 
CGCGGATCCGCTTCTCTGTAAGTTAATCCT (SEQ ID NO: 8) and contains 
the cleavage site for the restriction endonuclease BamHI and 21 nucleotides 
complementary to the 3' non-translated sequence of the chitinase-alpha gene. The 
amplified sequences are isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Joila, Ca.)- The fragment is then 
digested with the endonuclease BamHI, and then purified again on a 1% agarose 
gel. This fragment is designated F2. 

The vector pA2 (modification of pVL941 vector, discussed below) is used 
for the expression of the chitinase alpha protein using the baculovirus expression 
system (for review see: Summers, M.D. and Smith, G.E. A manual of methods for 
baculovirus vectors and insect cell culture procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555 (1987)). This expression vector contains 
the strong polyhedrin promoter of the Autographa califomica nuclear polyhedrosis 
virus (AcMNPV) followed by the recognition sites for the restriction endonuclease 
BamHI. The polyadenylation site of the simian virus (SV)40 is used for efficient 
polyadenylation. For an easy selection of recombinant virus th6 beta-galactosidase 
gene from E.colt is inserted in the same orientation as the polyhedrin promoter 
followed by the polyadenylation signal of the polyhedrin gene. The polyhedrin 
sequences are flanked at both sides by viral sequences for the cell-mediated 
homologous recombination of co-transfected wild-type viral DNA Many other 
baculovirus vectors could be used in place of pRGl such as pAc373, pVL941 and 
pAcIMl (Luckow, V.A. and Summers, M.D., Virology 770:31-39 ). 

The plasmid is digested with the restriction enzyme BamHI and then 
dephosphorylated using calf intestinal phosphatase by procedures known in the 
art. The DNA is then isolated from al/o agarose gel using the commercially 
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available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). This vector DNA is 
designated V2. 

Fragment F2 and the dephosphorylated plasmid V2 is ligated with T4 
DNA ligase. Kcoli HB101 cells were then transformed and bacteria identified 
that contained the plasmid (pBac chitinase alpha) with the chitinase-alpha gene 
using the enzyme BamHL The sequence of the cloned fragment is confirmed by 
DNA sequencing. 

Five /ig of the plasmid pBacchitinase alpha is co-transfected with 1 .0 ti% 
of a commercially available linearized baculovirus ("BacuIoGoId™ baculovims 
DNA H , Phaimingen, San Diego, CA) using the lipofection method (Feigner et aL y 
Proc. Natl Acad. ScU (USA) W:7413-7417 (1987)). 

One/ig of BacuIoGoId™ virus DNA and 5 ptg of the plasmid pBac 
chitinase alpha are mixed in a sterile well of a microtiter plate containing 50 /zl of 
serum free Grace's medium (Life Technologies Inc., Gahhersburg, MD). 
Afterwards 10 pel Lipofectin plus 90 /zl Grace's medium are added, mixed and 
incubated for 15 minutes at room temperature. Then the transfection mixture is 
added drop- wise to the Sfl? insect cells (ATCC CRL 1711) seeded in% 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is rocked 
back and forth to mix the newly added solution. The plate is then incubated for 
5 hours at 27°C. After 5 hours, the transfection solution is removed from the 
plate and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum 
is added. The plate is put back into an incubator and cultivation continued at 
27°C for four days. 

After four days, the supernatant is collected and a plaque assay is 
performed similar to the assay described by Summers and Smith (supra). As a 
modification, an agarose gel with "Blue Gal" (Life Technologies Inc., 
Gahhersburg) is used, which allows an easy isolation of blue stained plaques. (A 
detailed description of a "plaque assay" can also be found in the user's guide for 
insect ceil culture and baculovirology distributed by Life Technologies Inc., 
Gahhersburg, page 9-10). 
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Four days after the serial dilution, the virus is added to the cells, and blue 
stained plaques are picked with the tip of an Eppendorf pipette. The agar 
containing the recombinant viruses is then resuspended in an Eppendorf tube 
containing 200 /zl of Grace's medium. The agar is removed by a brief 
centrifugation and the supernatant containing the recombinant baculovirus was 
used to infect Sf9 cells seeded in 35 mm dishes. Four days later the supernatant? 
of these culture dishes are harvested and then stored at 4°C. 

Sf9 cells were grown in Grace's medium supplemented with 10% heat- 
inactivated FBS. The cells were infected with the recombinant baculovirus V- 
chitinase-alpha at a multiplicity of infection (MOI) of 2. Six hours later, the 
medium is removed and replaced with SF900 II medium minus methionine and 
cysteine (Life Technologies Inc., Gaithersburg). 42 hours later 5 /xCi of ?5 S- 
methionine and 5 jzCi 55 S cysteine (Amersham) are added. The cells are further 
incubated for 16 hours before they are harvested by centrifugation and the labelled 
proteins visualized by SDS-P AGE and autoradiography. 

Example 3 

Cloning and Expression in Mammalian Cells 

Most of the vectors used for the transient expression of the chitinase alpha 
protein gene sequence in mammalian cells should cany the SV40 origin of 
replication. This allows the replication of the vector to high copy numbers in cells 
(e.g. COS cells) which express the T antigen required for the initiation of viral 
DNA synthesis. Any other mammalian cell line can also be utilized for this 
purpose. 

A typical mammalian expression vector contains the promoter element, 
which mediates the initiation of transcription of mRNA, the protein coding 
sequence, and signals required for the termination of trancription and 
polyadenyiation of the transcript. Additional elements include enhancers, Kozak 
sequences and intervening sequences flanked by donor and acceptor sites for RNA 
splicing. Highly efficient transcription can be achieved with the early and late 



51 

promoters from SV40, the long terminal repeats (LTRs) from Retroviruses, e.g. 
RSV, HTLVI, fflVI and the early promoter of the cytomegalovirus (CMV). 
However, cellular signals can also be used (e.g. human actin promoter). Suitable 
expression vectors for use in practicing the present invention include, for example, 
vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), pRSVcat 
(ATCC 37152), pSV2dhfr (ATCC 37146) and pBC12MI (ATCC 67109). 
Mammalian host cells that could be used include, human Hela, 283, H9 and 
Jurkart cells, mouse NIH3T3 and CI 27 cells, Cos 1, Cos 7 and C VI, African 
green monkey cells, quail QC1-3 cells, mouse L cells and Chinese hamster ovary 
cells. 

Alternatively, the gene can be expressed in stable cell lines that contain the 
gene integrated into a chromosome. The co-transfection with a selectable marker 
such as dhfr, gpt, neomycin, hygromycih allows the identification and isolation of 
the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) is a useful marker to 
develop cell lines that cany several hundred or even several thousand copies of the 
gene of interest. Another useful selection marker is the enzyme glutamine synthase 
(GS) (Murphy et al, Biochem J. 227:277-279 (1991); Bebbington et al, 
Bio/Technology 70:169-175 (1992)). Using these markers, the mammalian cells 
are grown in selective medium and the cells with the highest resistance are 
selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) cells are often used for the 
production of proteins. 

The expression vectors pCl and pC4 contain the strong promoter (LTR) 
of the Rous Sarcoma Virus (Cullen et al. 9 Molecular and Cellular Biology, 
438-4470 (March, 1985)) plus a fragment of the CMV-enhancer (Boshart et al, 
CelI4J:52US30 (1985)). Multiple cloning sites, e.g. with the restriction enzyme 
cleavage sites BamHI, Xbal and Asp718, facilitate the cloning of the gene of 
interest. The vectors contain in addition the 3' intron, the polyadenylation and 
termination signal of the rat preproinsulin gene. 
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Example 3(a) Expression in COS Cells 

The expression plasmid, chitinase alpha HA, is made by cloning a cDNA 
encoding chitinase alpha into the expression vector pcDNAI/Amp (which can be 
obtained from Invitrogen, Inc.). 

The expression vector pcDNAI/amp contains: (1) an Kcoli origin of 
replication effective for propagation in K coli and other prokaryotic cells; (2) an 
ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; (3) 
an SV40 origin of replication for propagation in eukaryotic cells; (4) a CMV 
promoter, a polylinker, an SV40 intron, and a polyadenylation signal arranged so 
that a cDNA conveniently can be placed under expression control of the CMV 
promoter and operabiy linked to the SV40 intron and the polyadenylation signal 
by means of restriction sites in the polylinker. 

A DNA fragment encoding the entire chitinase alpha precursor and an HA 
tag fused in frame to its 3' end is cloned into the polylinker region of the vector so 
that recombinant protein expression is directed by the CMV promoter. The HA 
tag corresponds to an epitope derived from the influenza hemagglutinin protein 
described by Wilson et ai y Cell 37:161 (1984). The fusion of the HA tag to the 
target protein allows easy detection of the recombinant protein with an antibody 
that recognizes the HA epitope. 

The plasmid construction strategy is as follows. 

The chitinase alpha cDNA of the deposited clone is amplified using primers 
that contain convenient restriction sites, much as described above regarding the 
construction of expression vectors for expression of chitinase alpha in K coli. To 
facilitate detection, purification and characterization of the expressed chitinase 
alpha, one of the primers contains a hemagglutinin tag ( tt HA tag") as described 
above. 

Suitable primers include that following, which are used in this example. 
The 5* primer, containing the underlined BamHI ate, an AUG start codon and 6 
codons of the 5' coding region has the following sequence: 
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5 % CGC GGA TCC GCC ATC ATG GGA GCA ACC ACC ATG (SEQ ID 
NO: 9) 

The 3' primer, containing complementary sequences to the BamHI site, a 
stop codon, 9 codons thereafter forming the hemagglutinin HA tag, and 18 bp of 
S 3 1 coding sequence (at the 3' end) has the following sequence: 

5' CGC GGA TCC TCA AGC GTA GTC TGG GAC GTC GTA TGG GTA 
CAG GGA GCC AAG GCT TCT (SEQ ID NO: 10) 

The PCR amplified DNA fragment and the vector, pcDNAI/Amp, are 
digested with BamHI and then ligated. The ligation mixture is transformed into 
10 E. coli strain SURE (available from Stratagene Cloning Systems, 1 1099 North 

Torrey Pines Road, La Jolla, CA 92037), and the transformed culture is plated on 
ampicillin media plates which then are incubated to allow growth of ampiciliin 
resistant colonies. Plasmid DNA is isolated from resistant colonies and examined 
by restriction analysis and gel sizing for the presence of the chemokine alpha- 
15 encoding fragment. 

For expression of recombinant chemokine alpha, COS cells are transfected 
with an expression vector, as described above, using DEAE-DEXTRAN, as 
described, for instance, in Sambrook, J. et a/., Molecular Cloning: A Laboratory 
Manual* Cold Spring Laboratory Press, Cold Spring Harbor, New York (1989). 
20 Cells are incubated under conditions for expression of chemokine alpha by the 

vector. 

Expression of the chemokine alpha HA fusion protein is detected by 
radiolabelling and immunoprecipitation, using methods described in, for example 
Harlow et aL, Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring Harbor 

25 Laboratory Press, Cold Spring Harbor, New York (1 988). To this end, two days 

after transfection, the cells are labeled by incubation in media containing 35 S- 
cysteine for 8 hours. The cells and the media are collected, and the cells are 
washed and the lysed with detergent-containing RIP A buffer: ISO mM NaCl, 1% 
NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as described 

30 by Wilson et al. cited above. Proteins are precipitated from the cell lysate and 

from the culture media using an HA-specific monoclonal antibody. The 
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precipitated proteins then are analyzed by SDS-PAGE gels and autoradiography. 
An expression product of the expected size is seen in the cell lysate, which is not 
seen in negative controls. 

Example 3(b): Cloning and Expression in CHO Cells 

5 The DNA sequence encoding chitinase alpha protein is amplified using 

PCR oligonucleotide primers specific to the amino terminal sequence of the 
protein and to vector sequences 3 ' to the gene. Additional nucleotides containing 
restriction sites to facilitate cloning are added to the 5' and 3' sequences 
respectively. The restrictions sites are convenient to restriction enzyme sites in the 

10 CHO expression vector PC4. 

The amplified chitinase alpha protein DNA and the vector PC4 both are 
digested with BamHI and the digested DNAs then iigated together. Insertion of 
the chitinase alpha protein DNA into the BamHI restricted vector places the 
protein coding region downstream of and operably linked to the vector's 

15 promoter. The ligation mixture is transformed into competent CHO cells using 

standard procedures as described, for example, in Sambrook et al. y 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed.; Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y, (1989). 
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Example 4 



Expression Pattern of Chitinase Alpha in Human Tissue 

Northern blot analysis is carried out to examine chitinae alpha gene 
expression in human tissues, using methods described by, among others, 
Sambrook et a/., cited above. A cDNA probe containing the entire nucleotide 
sequence of the chitinase alpha protein (SEQ ID NO:l) is labeled with M P using 
the m/iprime™ DNA labeling system (Amersham Ufe Science), according to 
manufacturer's instructions. After labelling, the probe is purified using a 
CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), according to 
manufacturer's protocol number PT1 200-1. The purified labelled probe is then 
used to examine various human tissues for chitinase alpha mRNA. 

Multiple Tissue Northern (MTN) blots containing various human tissues 
(H) or human immune system tissues (IM) are obtained from Clontech and are 
examined with labelled probe using ExpressHyb™ hybridization solution 
(Clontech) according to manufacturer's protocol number PT1 190-1. Fallowing 
hybridization and washing, the blots are mounted and exposed to film at *70°C 
overnight, and films developed according to standard procedures. 

Numerous modifications and variations of the present invention are 
possible in light of the above teachings and, therefore, within the scope of the 
appended claims, the invention may be practiced otherwise than as particularly 
described. 

The disclosures of all patents, patent applications and publications referred 
to herein are hereby incorporated by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: HUMAN GENOME SCIENCES, INC. 

9410 KEY WEST AVENUE 
ROCKVILLE, MD 20850 
APPLICANT/ INVENTORS: THOTAKURA, NAGESWARA RAO 

CHOPRA, ARVIND 
CHOI, GIL 
GENTZ, REINER 
ROSEN, CRAIG 

(ii) TITLE OF INVENTION: CHITINASE 

(iii) NUMBER OF* SEQUENCES : 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX, P.L.L.C. 

(B) STREET: 1100 NEW YORK AVENUE, SUITE 600 

(C) CITY: WASHINGTON 

(D) STATE: D.Ci 
<E) COUNTRY,: US 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk ^ 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: TBA 
<B) FILING DATE: 09- AUG- 96 
(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: GOLDSTEIN, JORGE A. 
(B) REGISTRATION NUMBER: 29,021 
(C) REFERENCE /DOCKET NUMBER: 1488.046PCO0 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 



(2) INFORMATION FOR SEQ ID NO:l: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1595~ base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 82.. 1344 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GAATTCGGCA CGAGGGCTGT CGAAACCTCA GTGGATAAAA GACCTAGAGA ATGTGTATCC 

CAGAAGAAGC TGGCCAAGGA T ATG GGA GCA ACC ACC ATG GAC CAG AAG TCT 

Met Gly Ala Thr Thr Met Asp Gin Lys Ser 
1 5 10 

CTC TGG GCA GGT GTA GTG GTC TTG CTG CTT CTC CAG GGA GAG ATG GGG 
Leu Trp Ala Gly Val Val Val Leu Leu Leu Leu Gin Gly Glu Met Gly 
15 20 . 25 

TTT TGC TAT GTT GCC AGA GCT GGT CTT GAA CTC CTG GGC TCA AGA AGT 
Phe Cys Tyr Val Ala Arg Ala Gly Leu Glu Leu Leu Gly Ser Arg Ser 
30 35 40 ^ 

CCT CCT GCC TCA GCC TCC CAA AGT GCT GGG ATA ACA GGA TCT GCC TAC 
Pro Pro Ala Ser Ala Ser Gin Ser Ala Gly lie Thr Gly Ser Ala Tyr 
45 50 55 

AAA CTG GTT TGC TAC TTT ACC AAC TGG TCC CAG GAC CGG CAG GAA CCA 
Lys Leu Val Cys Tyr Phe Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro 
60 65 70 

GGA AAA TTC ACC CCT GAG AAT ATT GAC CCC TTC CTA TGC TCT CAT CTC 
Gly Lys Phe Thr Pro Glu Asn lie Asp Pro Phe Leu Cys Ser His Leu 
75 80 85 90 

ATC TAT TCA TTC GCC AGC ATC GAA AAC AAC AAG GTT ATC ATC AAG GAC 
lie Tyr Ser Phe Ala Ser lie Glu Asn Asn Lys Val lie lie Lys Asp 
95 100 105 

AAG AGT GAA GTG ATG CTC TAC CAG ACC ATC AAC AGT CTC AAA ACC AAG 
Lys Ser Glu Val Met Leu Tyr Gin Thr lie Asn Ser Leu Lys Thr Lys 
110 115 120 

AAT CCC AAA CTG AAA ATT CTC TTG TCC ATT GGA GGG TAC CTG TTT GGT 
Asn Pro Lys Leu Lys lie Leu Leu Ser lie Gly Gly Tyr Leu Phe Gly 
125 130 135 



TCC AAA GGG TTC CAC CCT ATG GTG GAT TCT TCT ACA TCA CGC TTG GAA 
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Ser Lys Gly Phe His Pro Met Val Asp Ser Ser Thr Ser Arg Leu Glu 
140 145 150 

TTC ATT AAC TCC ATA ATC CTG TTT CTG AGG AAC CAT AAC TTT GAT GGA 5 91 

Phe lie Asn Ser lie lie Leu Phe Leu Arg Asn His Asn Phe Asp Gly 
155 160 165 170 

CTG GAT GTA AGC TGG ATC TAC CCA GAT CAG AAA GAA AAC ACT CAT TTC 63 9 

Leu Asp Val Ser Trp lie Tyr Pro Asp Gin Lys Glu Asn Thr His Phe 
175 180 185 

ACT GTG CTG ATT CAT GAG TTA GCA GAA GCC TTT CAG AAG GAC TTC ACA 68 7 

Thr Val Leu lie His Glu Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr 
190 195 200 

AAA TCC ACC AAG GAA AGG CTT CTC TTG ACT GCG GGC GTA TCT GCA GGG 735 
Lys Ser Thr Lys Glu Arg Leu Leu Leu Thr Ala Gly Val Ser Ala Gly 
205 * 210 215 

AGG CAA ATG ATT GAT AAC AGC TAT CAA GTT GAG AAA CTG GCA AAA GAT 783 
Arg Gin Met lie Asp Asn Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp 
220 225 230 

CTG GAT TTC ATC AAC CTC CTG TCC TTT GAC TTC CAT GGG TCT TGG GAA 831 
Leu Asp Phe lie Asn Leu Leu Ser Phe Asp Phe His Gly Ser Trp Glu 
235 240 245 250 

AAG CCC CTT ATC ACT GGC CAC AAC AGC CCT CTG AGC AAG GGG TGG CAG 87 9 

Lys Pro Leu lie Thr Gly His Asn Ser Pro Leu Ser Lys Gly Trp Gin 
255 260 265 

GAC AGA GGG CCA AGC TCC TAC TAC AAT GTG GAA TAT GCT GTG GGG TAC 927 
Asp Arg Gly Pro Ser Ser Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr 
270 275 280 

TGG ATA CAT AAG GGA ATG CCA TCA GAG AAG GTG GTC ATG GGC ATC CCC 975 
Trp lie His Lys Gly Met Pro Ser Glu Lys Val Val Met Gly lie Pro 
285 290 295 

ACA TAT GGG CAC TCC TTC ACA CTG GCC TCT GCA GAA ACC ACC GTG GGG 1023 
Thr Tyr Gly His Ser Phe Thr Leu Ala Ser Ala Glu Thr Thr Val Gly 
300 305 310 

GCC CCT GCC TCT GGC CCT GGA GCT GCT GGA CCC ATC ACA GAG TCT TCA 1071 
Ala Pro Ala Ser Gly Pro Gly Ala Ala Gly Pro lie Thr Glu Ser Ser 
315 320 325 330 

GGC TTC CTG GCC TAT TAT GAG ATC TGC CAG TTC CTG AAA GGA GCC AAG 1119 
Gly Phe Leu Ala Tyr Tyr Glu lie Cys Gin Phe Leu Lys Gly Ala Lys 
335 340 345 

ATC ACG AGG CTC CAG GAT CAG CAG GTT CCC TAC GCA GTC AAG GGG AAC 1167 
lie Thr Arg Leu Gin Asp Gin Gin Val Pro Tyr Ala Val Lys Gly Asn 
350 355 360 
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CAG TGG GTG GGC TAT GAT GAT GTG AAG AGT ATG GAG ACC AAG GTT CAG 12: 
Gin Trp Val Gly Tyr Asp Asp Val Lys Ser Met Glu Thr Lys Val Gin 
365 370 375 

TTC TTA AAG AAT TTA AAC CTG GGA GGA GCC ATG ATC TGG TCT ATT GAC 12 1 

Phe Leu Lys Asn Leu Asn Leu Gly Gly Ala Met lie Trp Ser lie Asp 
380 385 390 

ATG GAT GAC TTC ACT GGC AAA TCC TGC AAC CAG GGC CCT TAC CCT CTT . 13: 

Met Asp Asp Phe Thr Gly Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu 
395 400 405 410 

GTC CAA GCA GTC AAG AGA AGC CTT GGC TCC CTG TGAAGGATTA ACTTACAGAG 13 < 

Val Gin Ala Val Lys Arg Ser Leu Gly Ser Leu 
415 420 

AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT CTCATGTGGG 14: 

ATTCCCCTTG CCAGGCCGGC CTTTGGATCT CT C TTC C AAG CCTTTCCTGA CTTCCTCTTA 141 

GATCATAGAT TGGACCTGGT TTTGTTTTCC TGCAGCTGAT GACTTGTTGC CCTGAAGTAC 15- 

AATAAAAAAA ATTCATTTTG CTCCAGTAAA AAAAAAAAAA AAAAACTCGA G 15! 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 amino acids *%, 
<B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 : 

Met Gly Ala Thr Thr Met Asp Gin Lys Ser Leu Trp Ala Gly Val Val 
1 5 10 15 

Val Leu Leu Leu Leu Gin Gly Glu Met Gly Phe Cys Tyr Val Ala Arg 
20 25 30 

Ala Gly Leu Glu Levi Leu Gly Ser Arg Ser Pro Pro Ala Ser Ala Ser 
35 40 45 

Gin Ser Ala Gly lie Thr Gly Ser Ala Tyr Lys Leu Val Cys Tyr Phe 
50 55 60 

Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro Gly Lys Phe Thr Pro Glu 
65 70 75 . 80 

Asn He Asp Pro Phe Leu Cys Ser His Leu He Tyr Ser Phe Ala Ser 
85 90 95 

He Glu Asn Asn Lys Val lie He Lys Asp Lys Ser Glu Val Met Leu 
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100 - 105 HO 

Tyr Gin Thr lie Asn Ser Leu Lys Thr Lys Asn Pro Lys Leu Lys lie 
115 120 125 

Leu Leu Ser lie Gly Gly Tyr Leu Phe Gly Ser Lys Gly Phe His Pro 
130 135 140 

Met Val Asp Ser Ser Thr Ser Arg Leu Glu Phe lie Asn Ser He He 
145 150 155 160 

Leu Phe Leu Arg Asn His Asn Phe Asp Gly Leu Asp Val Ser Trp He 
165 170 175 

Tyr Pro Asp Gin Lys Glu Asn Thr His Phe Thr Val Leu He His Glu 
180 IBS 190 

Leu Ala Glu Ala Phe* Gin Lys Asp Phe Thr Lys Ser Thr Lys Glu Arg 
195 200 205 

Leu Leu Leu Thr Ala Gly Val Ser Ala Gly Arg Gin Met He Asp Asn 
210 215 220 

Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp Leu Asp Phe He Asn Leu 
225 230 235 240 

Leu Ser Phe Asp Phe His Gly Ser Trp Glu Lys Pro Leu He Thr Gly 
245 250 25§ 

His Ash Ser Pro Leu Ser Lys Gly Trp Gin Asp Arg Gly Pro Ser Ser 
260 265 270 

Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr Trp He His Lys Gly Met 
275 280 285 

Pro Ser Glu Lys Val Val Met Gly He Pro Thr Tyr Gly His Ser Phe 
290 295 300 

Thr Leu Ala Ser Ala Glu Thr Thr Val Gly Ala Pro Ala Ser Gly Pro 
305 310 315 320 

Gly Ala Ala Gly Pro He Thr Glu Ser Ser Gly Phe Leu Ala Tyr Tyr 
325 330 335 

Glu He Cys Gin Phe Leu Lys Gly Ala Lys He Thr Arg Leu Gin Asp 
340 345 350 

Gin Gin Val Pro Tyr Ala Val Lys Gly Asn Gin Trp Val Gly Tyr Asp 
355 360 365 

Asp Val Lys Ser Met Glu Thr Lys Val Gin Phe Leu Lys Asn Leu Asn 
370 375 380 

Leu Gly Gly Ala Met He Trp Ser He Asp Met Asp Asp Phe Thr Gly 
385 390 395 400 
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Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu Val Gin Ala Val Lys Arg 
405 410 415 

Ser Leu Gly Ser Leu 
420 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE:- 

(A) NAME /KEY : CDS 

(B) LOCATION: 82 1344 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAATTCGGCA CGAGGGCTGT CGAAACCTCA GTGGATAAAA GACCTAGAGA ATGTGTATCC € 

CAGAAGAAGC TGGCCAAGGA T ATG GGA GCA ACC ACC ATG GAC CAG AAG TCT 13 

Met Gly Ala Thr Thr Met Asp Gin Lys Ser 
1 5 % 10 

CTC TGG GCA GGT GTA GTG GTC TTG CTG CTT CTC CAG GGA GAG ATG GGG IE 
Leu Trp Ala Gly Val Val Val Leu Leu Leu Leu Gin Gly Glu Met Gly 
15 20 25 

TTT TGC TAT GTT GCC AGA GCT GGT CTT GAA CTC CTG GGC TCA AGA AGT 20 
Phe Cys Tyr Val Ala Arg Ala Gly Leu Glu Leu Leu Gly Ser Arg Ser 
30 35 40 

CCT CCT GCC TCA GCC TCC CAA AGT GCT GGG ATA ACA GGA TCT GCC TAC 2 5 

Pro Pro Ala Ser Ala Ser Gin Ser Ala Gly lie Thr Gly Ser Ala Tyr . 
45 50 55 

AAA CTG GTT TGC TAC TTT ACC AAC TGG TCC CAG GAC CGG CAG GAA CCA 30 
Lys Leu Val Cys Tyr Phe Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro 
60 65 70 

GGA AAA TTC ACC CCT GAG AAT ATT GAC CCC TTC CTA TGC TCT CAT CTC 35 
Gly Lys Phe Thr Pro Glu Asn He Asp Pro Phe Leu Cys Ser His Leu 
75 80 85 90 

ATC TAT TCA TTC GCC AGC ATC GAA AAC AAC AAG GTT ATC ATC AAG GAC 3 9 

He Tyr Ser Phe Ala Ser He Glu Asn Asn Lys Val He He Lys Asp 
SS 100 105 



AAG AGT GAA GTG ATG CTC TAC CAG ACC ATC AAC AGT CTC AAA ACC AAG 



44 



WO 98/06859 PCT/US96/13003 



-62- 

Lys Ser Glu Val Met Leu Tyr Gin Thr lie Asn Ser Leu Lys Thr Lys 
110 115 120 

AAT CCC AAA CTG AAA ATT CTC TTG TCC ATT GGA GGG TAC CTG TTT GGT 4 95 

Asn Pro Lys Leu Lys lie Leu Leu Ser lie Gly Gly Tyr Leu Phe Gly 
125 130 135 

TCC AAA GGG TTC CAC CCT ATG GTG GAT TCT TCT ACA TCA CGC TTG GAA 54 3 

Ser Lys Gly Phe His Pro Met Val Asp Ser Ser Thr Ser Arg Leu Glu 
140 145 150 

TTC ATT AAC TCC ATA ATC CTG TTT CTG AGG AAC CAT AAC TTT GAT GGA 591 
Phe lie Asn Ser lie lie Leu Phe Leu Arg Asn His Asn Phe Asp Gly 
155 160 165 170 

CTG GAT GTA AGC TGG GAA TAC CCA GAT CAG AAA GAA AAC ACT CAT TTC 639 
Leu Asp Val Ser Trp Glu Tyr Pro Asp Gin Lys Glu Asn Thr His Phe 
175". 180 185 

ACT GTG CTG ATT CAT GAG TTA GCA GAA GCC TTT CAG AAG GAC TTC ACA 687 
Thr Val Leu lie His Glu Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr 
190 195 200 

AAA TCC ACC AAG GAA AGG CTT CTC TTG ACT GCG GGC GTA TCT GCA GGG 735 
Lys Ser Thr Lys Glu Arg Leu Leu Leu Thr Ala Gly Val Ser Ala Gly 
205 210 215 

AGG CAA ATG ATT GAT AAC AGC TAT CAA GTT GAG AAA CTG GCA AAA GAT 7 83 

Arg Gin Met lie Asp Asn Ser Tyr Gin Val Glu Lys Leu Ala Lys v Asp 
221) 225 230 

CTG GAT TTC ATC AAC CTC CTG TCC TTT GAC TTC CAT GGG TCT TGG GAA 831 
Leu Asp Phe lie Asn Leu Leu Ser Phe Asp Phe His Gly Ser Trp Glu 
235 240 245 250 

AAG CCC CTT ATC ACT GGC CAC AAC AGC CCT CTG AGC AAG GGG TGG CAG 879 
Lys Pro Leu lie Thr Gly His Asn Ser Pro Leu Ser Lys Gly Trp Gin 
255 260 265 

GAC AGA GGG CCA AGC TCC TAC TAC AAT GTG GAA TAT GCT GTG GGG TAC 927 
Asp Arg Gly Pro Ser Ser Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr 
270 275 280 

TGG ATA CAT AAG GGA ATG CCA TCA GAG AAG GTG GTC ATG GGC ATC CCC 975 
Trp lie His Lys Gly Met Pro Ser Glu Lys Val Val Met Gly lie Pro 
285 290 295 

ACA TAT GGG CAC TCC TTC ACA CTG GCC TCT GCA GAA ACC ACC GTG GGG 1023 
Thr Tyr Gly His Ser Phe Thr Leu Ala Ser Ala Glu Thr Thr Val Gly 
300 305 310 

GCC CCT GCC TCT GGC CCT GGA GCT GCT GGA CCC ATC ACA GAG TCT TCA 1071 
Ala Pro Ala Ser Gly Pro Gly Ala Ala Gly Pro lie Thr Glu Ser Ser 
315 320 325 330 
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GGC TTC CTG GCC TAT TAT GAG ATC TGC CAG TTC CTG AAA GGA GCC AAG 11- 

Gly Phe Leu Ala Tyr Tyr Glu lie Cys Gin Phe Leu Lys Gly Ala Lys 
335 340 345 

ATC ACG AGG CTC CAG GAT CAG CAG GTT CCC TAC GCA GTC AAG GGG AAC 11 € 

lie Thr Arg Leu Gin Asp Gin Gin Val Pro Tyr Ala Val Lys Gly Asn 
350 355 360 

CAG TGG GTG GGC TAT GAT GAT GTG AAG AGT ATG GAG ACC AAG GTT CAG 123 
Gin Trp Val Gly Tyr Asp Asp Val Lys Ser Met Glu Thr Lys Val Gin 
365 370 375 

TTC TTA AAG AAT TTA AAC CTG GGA GGA GCC ATG ATC TGG TCT ATT GAC 12 € 

Phe Leu Lys Asn Leu Asn Leu Gly Gly Ala Met He Trp Ser He Asp 
380 385 390 

ATG GAT GAC TTC ACT GGC AAA TCC TGC AAC CAG GGC CCT TAC CCT CTT 133 
Met Asp Asp Phe Thr Gly Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu 
395 400 405 410 

GTC CAA GCA GTC AAG AGA AGC CTT GGC TCC CTG TGAAGGATTA ACTTACAGAG 13 6 

Val Gin Ala Val Lys Arg Ser Leu Gly Ser Leu 
415 420 

AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT CTCATGTGGG 1*4 S 

ATTCCCCTTG CCAGGCCGGC CTTTGGATCT CTCTTCCAAG CCTTTC CTGA CTTCCTCTTA 14 £ 

GATCATAGAT TGGACCTGGT TTTGTTTTCC TGCAGCTGAT GACTTGTTGC CCTGAAGTAC 154 

AATAAAAAAA ATTCATTTTG CTCCAGTAAA AAAAAAAAAA AAAAACTCGA G 155 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Gly Ala Thr Thr Met Asp Gin Lys Ser Leu Trp Ala Gly Val Val 
1 5 10 15 

Val Leu Leu Leu Leu Gin Gly Glu Met Gly Phe Cys Tyr Val Ala Arg 
20 25 30 

Ala Gly Leu Glu Leu Leu Gly Ser Arg Ser Pro Pro Ala Ser Ala Ser 
35 40 45 

Gin Ser Ala Gly He Thr Gly Ser Ala Tyr Lys Leu Val Cys Tyr Phe 
50 55 60 
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Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro Gly Lys Phe Thr Pro Glu 
65 70 75 80 

Asn lie Asp Pro Phe Leu Cys Ser His Leu lie Tyr Ser Phe Ala Ser 
85 90 95 

lie Glu Asn Asn Lys Val lie He Lys Asp Lys Ser Glu Val Met Leu 
100 105 no 

Tyr Gin Thr He Asn Ser Leu Lys Thr Lys Asn Pro Lys Leu Lys He 
115 120 125 

Leu Leu Ser He Gly Gly Tyr Leu Phe Gly Ser Lys Gly Phe His Pro 
130 135 140 

Met Val Asp Ser Ser Thr Ser Arg Leu Glu Phe He Asn Ser He lie 
145 150 155 i 6 o 

Leu Phe Leu Arg Asn His Asn Phe Asp Gly Leu Asp Val Ser Trp Glu 
Ifr5 170 175 

Tyr Pro Asp Gin Lys . Glu Asn Thr His Phe Thr Val Leu He His Glu 
180 185 190 

Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr Lys Ser Thr Lys Glu Arg 
195 200 205 

Leu Leu Leu Thr Ala Gly Val Ser Ala Gly Arg Gin Met He Asp Asn 
210 215 220 

Ser Tyr Gin Val Glu Lys Leu Ala Lys Asp Leu Asp Phe He Asn Leu 
225 230 235 240 

Leu Ser Phe Asp Phe His Gly Ser Trp Glu Lys Pro Leu He Thr Gly 
245 250 255 

His Asn Ser Pro Leu Ser Lys Gly Trp Gin Asp Arg Gly Pro Ser Ser 
260 265 270 

Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr Trp He His Lys Gly Met 
275 280 285 

Pro Ser Glu Lys Val Val Met Gly He Pro Thr Tyr Gly His Ser Phe 
290 295 300 

Thr Leu Ala Ser Ala Glu Thr Thr Val Gly Ala Pro Ala Ser Gly Pro 
305 310 315 320 

Gly Ala Ala Gly Pro He Thr Glu Ser Ser Gly Phe Leu Ala Tyr Tyr 
325 330 335 

Glu He Cys Gin Phe Leu Lys Gly Ala Lys He Thr Arg Leu Gin Asp 
340 345 350 



Gin Gin Val Pro Tyr Ala Val Lys Gly Asn Gin Trp Val Gly Tyr Asp 
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355 . 360 365 

Asp Val Lys Ser Met Glu Thr Lys Val Gin Phe Leu Lys Asn Leu Asn 
370 375 380 



Leu Gly Gly Ala Met He Trp Ser He Asp Met Asp Asp Phe Thr Gly 
385 390 395 400 

Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu Val Gin Ala Val Lys Arg 
405 410 415 

Ser Leu Gly Ser Leu 
420 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TY^E: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CGCGGATCCA TGGGAGCAAC CACCATGGAC CAG 
(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 6 : 
CGCGGATCCG CTTCTCTGTA AGTTAATCCT 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 

CGCGGATCCG CCATCATGGG AGCAACCACC ATG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CGCGGATCCG CTTCTCTGTA AGTTAATCCT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGCGGATCCG CCATCATGGG AGCAACCACC ATG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGCGGATCCT CAAGCGTAGT CTGGGACGTC GTATGGGTAC AGGGAGCCAA GGCTTCT 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE : cDNA 



(ix) FEATURE 

(A) NAME /KEY : CDS 

(B> LOCATION: 27.. 1181 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGAATTCGGC ACGAGGAGCA ACCACC ATG GAC CAG AAG TCT CTC TGG GCA GGT i 

Met Asp Gin Lys Ser Leu Trp Ala Gly 
1 5 

GTA GTG GTC TTG CTG CTT CTC CAG GGA GGA TCT GCC TAC AAA\ £TG GTT 1< 
Val Val Val Leu Leu Leu Leu Gin Gly Gly Ser Ala Tyr Lys Leu Val 
10 " 15 20 25 

TGC TAC TTT ACC AAC TGG TCC CAG GAC CGG CAG GAA CCA GGA AAA TTC 1* 
Cys Tyr Phe Thr Asn Trp Ser Gin Asp Arg Gin Glu Pro Gly Lys Phe 
30 35 40 

ACC CCT GAG AAT ATT GAC CCC TTC CTA TGC TCT CAT CTC ATC TAT TCA 1! 
Thr Pro Glu Asn lie Asp Pro Phe Leu Cys Ser His Leu lie Tyr Ser 
45 50 55 

TTC GCC AGC ATC GAA AAC AAC AAG GTT ATC ATC AAG GAC AAG AGT GAA 2* 
Phe Ala Ser lie Glu Asn Asn Lys Val lie lie Lys Asp Lys Ser Glu 
60 65 70 

GTG ATG CTC TAC CAG ACC ATC AAC AGT CTC AAA ACC AAG AAT CCC AAA 2: 
Val Met Leu Tyr Gin Thr lie Asn Ser Leu Lys Thr Lys Asn Pro Lys 
75 80 85 

CTG AAA ATT CTC TTG TCC ATT GGA GGG TAC CTG TTT GGT TCC AAA GGG 3* 
Leu Lys lie Leu Leu Ser lie Gly Gly Tyr Leu Phe Gly Ser Lys Gly 
90 95 100 105 



TTC CAC CCT ATG GTG GAT TCT TCT ACA TCA CGC TTG GAA TTC ATT AAC 
Phe His Pro Met Val Asp Ser Ser Thr Ser Arg Leu Glu Phe lie Asn 
110 115 120 
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TCC ATA ATC CTG TTT CTG AGG AAC CAT AAC TTT GAT GGA CTG GAT GTA 437 
Ser lie lie Leu Phe "Leu Arg Asn His Asn Phe Asp Gly Leu Asp Val 
125 130 135 

AGC TGG ATC TAC CCA GAT CAG AAA GAA AAC ACT CAT TTC ACT GTG CTG 485 
Ser Trp He Tyr Pro Asp Gin Lys Glu Asn Thr His Phe Thr Val Leu 
140 145 150 

ATT CAT GAG TTA GCA GAA GCC TTT CAG AAG GAC TTC ACA AAA TCC ACC 533 
He His Glu Leu Ala Glu Ala Phe Gin Lys Asp Phe Thr Lys Ser Thr 
155 160 165 

AAG GAA AGG CTT CTC TTG ACT GCG GGC GTA TCT GCA GGG AGG CAA ATG 581 
Lys Glu Arg Leu Leu Leu Thr Ala Gly Val Ser Ala Gly Arg Gin Met 
170 175 180 185 

ATT GAT AAC AGC TAT CAA GTT GAG AAA CTG GCA AAA GAT CTG GAT TTC 629 
He Asp Asn Ser Tyr'.Gln Val Glu Lys Leu Ala Lys Asp Leu Asp Phe 
190, 195 200 

ATC AAC CTC CTG TCC TTT GAC TTC CAT GGG TCT TGG GAA AAG CCC CTT 677 
He Asn Leu Leu Ser .Phe Asp Phe His Gly Ser Trp Glu Lys Pro Leu 
205 210 215 

ATC ACT GGC CAC AAC AGC CCT CTG AGC AAG GGG TGG CAG GAC AGA GGG 725 
He Thr Gly His Asn Ser Pro Leu Ser Lys Gly Trp Gin Asp Arg Gly 
220 225 230 

CCA AGC TCC TAC TAC AAT GTG GAA TAT GCT GTG GGG TAC TGG ATA^CAT 773 
Pro Ser Ser Tyr Tyr Asn Val Glu Tyr Ala Val Gly Tyr Trp He His 
235 240 245 

AAG GGA ATG CCA TCA GAG AAG GTG GTC ATG GGC ATC CCC ACA TAT GGG 821 
Lys Gly Met Pro Ser Glu Lys Val Val Met Gly He Pro Thr Tyr Gly 
250 255 260 265 

CAC TCC TTC ACA CTG GCC TCT GCA GAA ACC ACC GTG GGG GCC CCT GCC 869 
His Ser Phe Thr Leu Ala Ser Ala Glu Thr Thr Val Gly Ala Pro Ala 
270 275 280 

TCT GGC CCT GGA GCT GCT GGA CCC ATC ACA GAG TCT TCA GGC TTC CTG 917 
Ser Gly Pro Gly Ala Ala Gly Pro He Thr Glu Ser Ser Gly Phe Leu 
285 290 295 

GCC TAT TAT GAG ATC TGC CAG TTC CTG AAA GGA GCC AAG ATC ACG AGG 965 
Ala Tyr Tyr Glu He Cys Gin Phe Leu Lys Gly Ala Lys He Thr Arg 
300 305 310 

CTC CAG GAT CAG CAG GTT CCC TAC GCA GTC AAG GGG AAC CAG TGG GTG 1013 
Leu Gin Asp Gin Gin Val Pro Tyr Ala Val Lys Gly Asn Gin Trp Val 
315 320 325 

GGC TAT GAT GAT GTG AAG AGT ATG GAG ACC AAG GTT CAG TTC TTA AAG 1061 
Gly Tyr Asp Asp Val Lys Ser Met Glu Thr Lys Val Gin Phe Leu Lys 
330 335 340 345 
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AAT TTA AAC CTG GGA GGA GCC ATG ATC TGG TCT ATT GAC ATG GAT GAC 110: 
Asn Leu Asn Leu Gly Gly Ala Met He Trp Ser He Asp Met Asp Asp 
350 355 360 

TTC ACT GGC AAA TCC TGC AAC CAG GGC CCT TAC CCT CTT GTC CAA GCA 115 
Phe Thr Gly Lys Ser Cys Asn Gin Gly Pro Tyr Pro Leu Val Gin Ala 
365 370 375 

GTC AAG AGA AGC CTT GGC TCC CTG TGAAGGATTA ACTTACAGAG AAGCAGGCAA 121 
Val Lys Arg Ser Leu Gly Ser Leu 
380 385 

GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCCAGGAATT CTCATGTGGG ATTCCCCTTG 127 

CCAGGCCGGC CTTTGGATCT CTCTTCCAAG CCTTTCC TG A CTTC CTCTTA GATCATAGAT 133 

TGGACCTGGT TTTGTTTTCC TGCAGCTGAT GACTTGTTGC CCTGAAGTAC AATAAAAAAA 139 

ATTCATTTTG CTCCAAAAAA AAAAAAAAAA AAAAACTCGA G 14 3 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein & c 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Asp Gin Lys Ser Leu Trp Ala Gly Val Val Val Leu Leu Leu Leu 
15 10 15 

Gin Gly Gly Ser Ala Tyr Lys Leu Val Cys Tyr Phe Thr Asn Trp Ser 
20 25 30 

Gin Asp Arg Gin Glu Pro Gly Lys Phe Thr Pro Glu Asn He Asp Pro 
35 40 45 

Phe Leu Cys Ser His Leu He Tyr Ser Phe Ala Ser He Glu Asn Asn 
50 55 60 

Lys Val He He Lys Asp Lys Ser Glu Val Met Leu Tyr Gin Thr He 
65 70 75 80 

Asn Ser Leu Lys Thr Lys Asn Pro Lys Leu Lys He Leu Leu Ser He 
85 .90 95 

Gly Gly Tyr Leu Phe Gly Ser Lys Gly Phe His Pro Met Val Asp Ser 
100 105 HO 

Ser Thr Ser Arg Leu Glu Phe He Asn Ser He He Leu Phe Leu Arg 
115 120 125 
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Asn His Asn Phe Asp 
130 

Lys Glu Asn Thr His 
145 

Phe Gin Lys Asp Phe 
165 

Ala Gly Val Ser Ala 
180 

Glu Lys Leu Ala Lys 
195 

Phe His Gly Ser Trp 
210 

Leu Ser Lys Gly Trp 
225 

Glu Tyr Ala Val Gly 
245 

Val Val Met Gly lie 
260 

Ala Glu Thr Thr Val 
275 

Pro Xle Thr Glu Ser 
290 

Phe Leu Lys Gly Ala 
305 

Tyr Ala Val Lys Gly 
325 

Met Glu Thr Lys Val 
340 

Met lie Trp Ser He 
355 

Gin Gly Pro Tyr Pro 
370 
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Gly Leu Asp Val Ser Trp 
135 

Phe Thr Val Leu He His 
150 155 

Thr Lys Ser Thr Lys Glu 
170 

Gly Arg Gin Met Xle Asp 
185 

Asp Leu Asp Phe He Asn 
200 

Glu Lys Pro Leu He Thr 
215 

Gin Asp Arg Gly Pro Ser 
230 235 

Tyr Trp He His Lys Gly 
250 

Pro Thr Tyr Gly His Ser 
265 

Gly Ala Pro Ala Ser Gly 
280 

Ser Gly Phe Leu Ala Tyr 
295 

Lys He Thr Arg Leu Gin 
310 315 

Asn Gin Trp Val Gly Tyr 
330 

Gin Phe Leu Lys Asn Leu 
345 

Asp Met Asp Asp Phe Thr 
360 

Leu Val Gin Ala Val Lys 
375 



He Tyr Pro Asp Gin 
140 

Glu Leu Ala Glu Ala 
160 

Arg Leu Leu Leu Thr 
175 

Asn Ser Tyr Gin Val 
190 

Leu Leu Ser Phe Asp 
205 

Gly His Asn Ser Pro 
220 

Ser Tyr Tyr Asn Val 
240 

Met Pro Ser Glu Lys 
255 

Phe Thr Leu Ala Ser 
270 

Pro Gly Ala Ala Gly 
285 

Tyr Glu He Cys Gin 
300 

Asp Gin Gin Val Pro 
320 

Asp Asp Val Lys Ser 
335 

Asn Leu Gly Gly Ala 
350 

Gly Lys Ser Cys Asn 
365 

Arg Ser Leu Gly Ser 
380 



Leu 
385 
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Inicmational applicaiionNo 
™P - 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



1 A. The indications made below relate to the microorganism referred to in toe description 

™ r . r 3 .... ' hnc • 


1 B. IDENTIFICATION OF DEPOSIT 


Further deposits arc identified on an additional sheet | | 


1 Name of depositary institution 




I American Type Culture Collection 




I Address of depositary institution (including postal code and country) 




123dl Parklawn Drive 
Rockville, Maryland 20852 
I United State of America 




1 Date of deposit 
| May 3, 1996 


Accession Number 

ATCC Designation 97529 


1 C. ADDITIONAL INDICATIONS (leave blank if mot appKcabte) This information is continued on an additional sheet Q 



ENA. plasmid 1290853 (Docket PF286) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the International Bureau later (specify the general nature of the indication eg., 'Accession 
Number of Deposit*) 



For receiving Office use only 



rJ/T This sheet was received with the &ternalional application 

1 




ized officer 



For International Bureau use only 



P"] This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/KO/134 (July 1992) 



What Is Claimed Is: 
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1. An isolated nucleic acid molecule comprising a polynucleotide 
having a nucleotide sequence at least 95% identical to a sequence selected from 
the group consisting of: 

(a) a nucleotide sequence encoding the chitinase alpha 
polypeptide having the complete amino add sequence in Figure 1 [SEQ ID NO:2] 
or Figure 2 [SEQ ID NO:4]; 

(b) a nucleotide sequence encoding the mature chitinase alpha 
polypeptide having the amino acid sequence at positions 27 -421 in Figure 1 
[SEQ ID NO:2] or Figure 2 [SEQ ID NO:4]; 

(c) a nucleotide sequence encoding the chitinase alpha 
polypeptide having the complete amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 97529; 

(d) a nucleotide sequence encoding the mature chitinase alpha 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 97529; 

(e) a nucleotide sequence encoding the chitinase alpha-2 
polypeptide having the complete amino add sequence shown in Figure 5 [SEQ ID 
NO: 12]; 

(f) a nucleotide sequence encoding the mature chitinase alpha- 
2 polypeptide having the amino acid sequence at positions 21-385 in Figure 5 
[SEQ ID NO: 12]; and 

(g) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), (c), (d), (e), or (f). 

2. The nudeic acid molecule of claim 1 wherein said polynucleotide 
has the complete nudeotide sequence in Figure 1 [SEQ ID NO: 1 ], Figure 2 [SEQ 
ID NO:3], or Figure 5 [SEQ ID NO: 11]. 
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3. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the nucleotide sequence in Figure 1 [SEQ ID NO:l], Figure 2 [SEQ ID 
NO:3], or Figure 5 [SEQ ID ,NO:ll] encoding the polypeptide having the 
complete amino acid sequence in Figure 1 [SEQ ID NO:2], Figure 2 [SEQ ID 
NO:4], or Figure 5 [SEQ ID NO: 12], respectively. 

4. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the nucleotide sequence in Figure 1 (SEQ ID NO:l], Figure 2 [SEQ ID 
NO:3], or Figure 5 [SEQ ID NO:ll] encoding the mature chitinase alpha 
polypeptide having the amino acid sequence at positions 27 - 421 in Figure 1 
[SEQ ID NO:2] or Figure 2 [SEQ ID NO:4] or encoding the mature chitinase 
alpha-2 polypeptide having the amino acid sequence at positions 21-385 in Figure 
5 [SEQ ID NO: 12]. 

5. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the complete nucleotide sequence of the cDNA clone contained in ATCC 
Deposit No. 97529. ^ 

6. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the nucleotide sequence encoding the chitinase alpha polypeptide having the 
complete amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 97529. 

7. The nucleic acid molecule of claim 1 wherein said polynucleotide 
has the nucleotide sequence encoding the mature chitinase alpha polypeptide 
having the amino acid sequence encoded by the cDNA clone contained in ATCC 
Deposit No. 97529. 

8. An isolated nucleic acid molecule comprising a polynucleotide 
which hybridizes under stringent hybridization conditions to a polynucleotide 
having a nucleotide sequence identical to a nucleotide sequence in (a), (b), (c), (d), 



73 



(e), (0, or (g) of claim 1 wherein said polynucleotide which hybrizes does not 
hybridize under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence consisting of only A residues or of only T residues. 

9. An isolated nucleic acid molecule comprising a polynucleotide 
which encodes the amino acid sequence of an epitope-bearing portion of a 
chitinase alpha or dutinase alpha-2 polypeptide having an amino acid sequence in 
(a), (b), (c), (d), (e), or (f) of claim 1. 

10. The isolated nucleic acid molecule of claim 9, which encodes an 
epitope-bearing portion of a chitinase alpha or chitinase alpha-2 polypeptide 
selected from the group consisting of: a polypeptide comprising amino acid 
residues from about 38 to about 51 in Figure 1 (SEQ ID NO:2); a polypeptide 
comprising amino acid residues from about 66 to about 85 in Figure 1 (SEQ ID 
NO:2); a polypeptide comprising amino acid residues from about 1 17 to about 
127 in Figure 1 (SEQ ID NO: 2); a polypeptide comprising amino acid residues 
from about 145 to about 153 in Figure 1 (SEQ ID NO:2); a polypeptide 
comprising amino acid residues from about 165 to about 172 in Figure 1 (SEQ ID 
NO:2); a polypeptide comprising amino acid residues from about 177 to about 
185 in Figure 1 (SEQ ID NO:2); a polypeptide comprising amino add residues 
from about 245 to about 254 in Figure 1 (SEQ ID NO:2); a polypeptide 
comprising amino add residues from about 256 to about 274 in Figure 1 (SEQ ID 
NO:2); a polypeptide comprising amino acid residues from about 286 to about 
293 in Figure 1 (SEQ ID NO:2); a polypeptide comprising amino add residues 
from about 366 to about 376 in Figure 1 (SEQ ID NO: 2); and a polypeptide 
comprising amino add residues from about 394 to about 407 in Figure 1 (SEQ ID 
NO:2). 

11. A method for making a recombinant vector comprising inserting 
an isolated nucldc add molecule of claim 1 into a vector. 
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12. A recombinant vector produced by the method of claim 1 1 . 

13. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 12 into a host cell. 

14. A recombinant host cell produced by the method of claim 13. 

15. A recombinant method for producing a chitinase alpha or chitinase 
alpha-2 polypeptide, comprising culturing the recombinant host cell of claim 14 
under conditions such that said polypeptide is expressed and recovering said 
polypeptide. 

16. An isolated chitinase alpha or chitinase alpha-2 polypeptide having 
an amino acid sequence at least 95% identical to a sequence selected from the 
group consisting of: 

(a) the amino acid sequence of the chitinase alpha polypeptide 
having the complete amino acid sequence in Figure 1 [SEQ ID NO;2] or Figure 
2 [SEQ ID NO:4]; 

(b) the amino acid sequence of the mature chitinase alpha 
polypeptide having the amino acid sequence at positions 27-421 in Figure 1 
[SEQ ID NO:2] or Figure 2 [SEQ ID NO:4]; 

(c) the amino acid sequence of the chitinase alpha polypeptide 
having the complete amino acid sequence encoded by the cDNA clone contained 
in ATCC Deposit No. 97529; and 

(d) the amino acid sequence of the mature chitinase alpha 
polypeptide having the amino acid sequence encoded by the cDNA clone 
contained in ATCC Deposit No. 97529; 

(e) the amino acid sequence of the chitinase alpha-2 
polypeptide having the complete 385 amino acid sequence including the leader 
sequence shown in Figure 5 [SEQ ID NO: 12]; 
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(f) the amino acid sequence of the mature chitinase aIpha-2 
polypeptide (without the leader) having the amino acid sequence at positions 21- 
385 in Figure 5 [SEQ ID NO: 12]. and 

(e) the amino acid sequence of an epitope-bearing portion of 
any one of the polypeptides of (a), (b), (c), (d), (e), or (f). 

17. An isolated polypeptide comprising an epitope-bearing portion of 
chitinase alpha, wherein said portion is selected from the group consisting of: a 
polypeptide comprising amino acid residues from about 38 to about 5 1 in Figure 
1 (SEQ ID NO:2); a polypeptide comprising amino acid residues from about 66 
to about 85 in Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid 
residues from about 1 17 to about 127 in Figure 1 (SEQ ID NO:2); a polypeptide 
comprising amino acid residues from about 145 to about 153 in Figure 1 (SEQ ID 
NO:2); a polypeptide comprising amino acid residues from about 165 to about 
172 in Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues 
from about 177 to about 185 in Figure 1 (SEQ ID NO:2); a polypeptide 
comprising amino acid residues from about 245 to about 254 in Figure 1 (SEQ ID 
NO: 2); a polypeptide comprising amino acid residues from about 256 to about 
274 in Figure 1 (SEQ ID NO:2); a polypeptide comprising amino acid residues 
from about 286 to about 293 in Figure 1 (SEQ ID NO:2); a polypeptide 
comprising amino acid residues from about 366 to about 376 in Figure 1 (SEQ ID 
NO:2); and a polypeptide comprising amino add residues from about 394 to about 
407 in Figure 1 (SEQ ID NO:2). 

18. An isolated antibody that binds specifically to a chitinase alpha or 
chitinase alpha-2 polypeptide of claim 16. 

19. A method for treating a patient having need of anti-fungal therapy 
comprising administering to said patient a therapeutically effective amount of the 
polypeptide of claim 16. 
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20. A method for treating a patient having a tissue remodeling 
disorder, comprising administering to said patient a therapeutically effective 
amount of an antagonist of the polypeptide of claim 16. 
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10 30 50 

gaattcggcacgagggctgtcgaaacctcagtggataaaagacctagagaatgtgtatcc 
70 90- no 

cagaagaagctggccaaggatATGGGAGCAACCACCATGGACCAGAAGTCTCTCTGGGCA 

MGATTMOOICSI u 
130 150 170 

GGTGTAGTGGTCTTGCTGCTTCTCCAGGGAGAGATGGGGTTTTGCTATGTTGCCAGAGCT 
G V V V I L L 1 0 G EMGFCYVARA 
190 210 230 

GGTCTTGAACTCCTGGGCTCAAGAAGTCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATA 
GLELLGSRSPPASASQSAGI 
250 270 290 

ACAGGATCTGCCTACAMCTGGTTTGCTACTTTACCAACTGGTCCCAGGACCGGCAGGAA 
T G S A Y K L V C Y F T N W S Q D R Q E 
310 330 350 

CCAGGAAAATTCACCCCTGAGAATATTGACCCCTTCCTATGCTCTCATCTCATCTATTCA 
PGKFTPEN. IDPFLCSHL IYS 
370 390 410 

TTCGCCAGCATCGAAAACAACAAGGTTATCATCAAGGACAAGAGTGAAGTGATGCTCTAC 
FASIENNKVI IKDK SEVML Y 
430 450 470 

CAGACCATCAACAGTCTCAAAACCAAGAATCCCAAACTGAAAATTCTCTTGTCCATTGGA 
QTI NSL KTKNPK LK I LLSI G 

490 510 530 

GGGTACCTGTTTGGTTCCAMGGGTTCt^CCCTATGGTGGATTCTTCTACATCACGCTTG 
GYLFGSKGFHPMVDSSTSRL 

550 570 590 

GAATTCATTAACTCCATAATCCTGTTTCTGAG6AACCATAACTTTGATGGACTGGATGTA 
E F I N S I ILFLRNHNFDGLD V 

610 630 650 

AGCTGGATCTACCCAGATC^GAAAGAAAACACTCATTTCACTGTGCTGATTCATGAGTTA 
SWI YPDQKEN THFTVL I HEL 

670 690 710 

GCAGMGCCTTTCAGMGGACTTCACAAAATCCACCAAGGAAAGGCTTCTCTTGACTGCG 
AEAFQKDFTKSTKERLLLTA 

730 750 770 

GGCGTATCTGCAGGGAGGCAAATGATTGATAACAGCTATCAAGTTGAGAAACTGGCAAAA 
GVSAGRQM I DNSYQV EKLAK 

790 810 830 

GATCTGGATTTCATCMCCTCCTGTCCTTTGACTTCCATGGGTCTTGGGAAAAGCCCCTT 
DLDFI NLLSFDFH,GSWEKPL 
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"850 870 890 

ATCACTGGCCACAACAGCCCTCTGAGCAAGGGGTGGCAGGACAGAGGGCCAAGCTCCTAC 
ITGH NSPLSKGWQDRGPSSY 

910 930 950 

TACAATGTGGAATATGCTGTGG6GTACTGGATACATAAGGGAATGCCATCAGAGAAGGTG 
Y N V E YAVG YW I H KGM PSE K V 
970 990 1010 

GTCATGGGCATCCCCACATATGGGCACTCCTTCACACTGGCCTCTGCAGAAACCACCGTG 
VMGI PTYGHSFT LASAETTV 
103fc 1050 1070 

GGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGAGTCTTCAGGCTTCCTG 
GAPjASGPG AAGPITESSGFI 
1090 1110 1130 

GCCTATTATGAGATCTGCCAGTTCCTGAAAGGAGCCAAGATCACGAGGCTCCAGGATCAG 
AYYE I CQFLKG AKITRLQDQ 

1150 1170 1190 

CAGGTTCCCTACGCAGTCAAGGGGAACCAGTGGGTGGGCTATGATGATGTGAAGAGTATG 
QVP YAVKG NQWV GYDDVKSM 

, 1210 1230 1250 

GAGACCMGGTTCAGTTCTTAAAGAATTTAAACCTGGGAGGAGCCATGATCTGGTCTATT 
ETKVQFLKN LNLG GAMIWSI 

1270 1290 1310 

GACATGGATGACTTCACTGGCAMTCCTGCMCCAGGGCCCTTACCCTCTTGTCCAA(fcA 
DMDD F TGKSCNQGPYPLVQA 
1330 1350 1370 

GTCAAGAGAAGCCTTGGCTCCCTGTGAaggattaacttacagagaagcaggcaagatgac 
VKRSLGSL* 

1390 1410 1430 

cttgctgcctggggcctgctctctcccaggaattctcatgtgggattccccttgccaggc 

1450 1470 1490 

cggcctttggatctctcttccaagcctttcctgacttcctcttagatcatagattggacc 
1510 1530 1550 

tggttttgttttcctgcagctgatgacttgttgccctgaagtacaataaaaaaaattcat 

1570 1590 
tttgctccagtaaaaaaaaaaaaaaaaaactcgag 

FIG.1B 
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10 30 50 

gaattcggcacgagggctgtcgaaacctcagtggataaaagacctagagaatgtgtatcc 
70 90- 110 

cagaagaagctggccaaggatATGGGAGCAACCACCATGGACCAGAAGTCTCTCTGQGCA 

MGATTMDOKSl MA 

130 150 170 

GGTGTAGTGGTCTTGCTGCTTCTCCAGGGAGAGATGGGGTTTTGCTATGTTGCCAGAGCT 
G V V V I L 1 I 0 R EMGFCYVARA 

190 210 230 

GGTCTTGAACTCCTGGGCTCAAGAAGTCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATA 
GLELLGSRSPPASASQSAG I 

250 270 290 

ACAGGATCTGCCTACAMCTGGTTTGCTACTTTACCAACTGGTCCCAGGACCGGCAGGAA 
TGS.AYKLVCYFTN WSQD RQE 
310 330 350 

CCAGGAAAATTCACCCCTGAGAATATTGACCCCTTCCTATGCTCTCATCTCATCTATTCA 
PGKFTPENIDPFLCSHL IYS 

370 390 410 

TTCGCCAGCATCGAAAACAACAAGGTTATCATCAAGGACAAGAGTGAAGTGATGCTCTAC 
FASIENNKVIIKDKSEVMLY 

430 450 470 

CAGACCATCMCAGTCTCAAMCCMGMTCCCAMCTGAAMTTCTCTTGTCCATTGGA 
QT INSLKTKNPKLKILLSIG 

490 510 530 ^ 

GGGTACCTGmGGTTC(^GGGnC(^CCCTATGGTGGAmTTCTACATCACGCTTG 
GYLFGSKGFHPMVDSSTSRL 

550 570 590 

GMTTCATTMCTCCATMTCCTGTTTCTGAGGAACCATAACTTTGATGGACTGGATGTA 
EFINSI I LFLRNHNFDGLDV 

610 630 650 

AGCTGGGMTACCCAGAraGAMGAAMCACTCATTTCACTGTGCTGATTCATGAGTTA 
SWEYPDQKENTHFTVL I HEL 

670 690 710 

G(^GMGCCTTTCAGMGGACnCACAAMTCCAC(MGGAAAGGCnCTCnGACTGCG 
AEAFQKDFTKSTKERLLLTA 

730 750 770 

GGCGTATCTGCAGGGAGGCAMTGATTGATM(^CT^AT(^GTTGAGAMCTGGCAAAA 
GVSAGRQMIDNSYQVEKLAK 

790 810 830 

GATCTGGATTTCATCMCCTCCTGTCCTTTGACTTCCATG^TCTTGGGAAAAGCCCCTT 
DLDFI NLLSFDFHGSWEKPL 
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850 870 890 

ATCACTGGCCACAACAGCCCTCTGAGCAAGGGGTG6CA6GACAGA6GGCCAAGCTCCTAC 
ITGHNSPLSKGWQDRGPSSY 

910 930 950 

TACAATGTGGAATATGCTGTGGGGTACTGGATACATAAGGGAATGCCATCAGAGAAGGTG 
YNV EYAVGYWIHKGMPSEKV 

970 990 1010 

GTCATGGGCATCCCCACATATGGGCACTCCTTCACACTGGCCTCTGCAGAAACCACCGTG 
VMGI PTY GHSFTLASAETTV 

1030 1050 1070 

GGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGAGTCTTCAGGCTTCCTG 
GAPASGPGAAGPITESSGFL 

1090 1110 1130 

GCCTATTATGAGATCTGCCAGtTCCTGAAAGGAGCCAAGATCACGAGGCTCCAGGATCAG 
AYYE I€Q FLKGA K ITRLQDQ 

1150 1170 1190 

CAGGTTCCCTACGCAGTCAAGGGGAACCAGTGGGTGGGCTATGATGATGTGAAGAGTATG 
QVPYAVKGNQWVGYDDV K SM 

1210 1230 1250 

GAGACCMGGTTCAGTTCTTAAAGAATTTAAACCTGGGAGGAGCCATGATCTGGTCTATT 
ETKVQ F L KNLNLGGAM I W S I 

1270 1290 1310 

GACATGGATGACTTCACTGGCAAATCCTGCAACCAGGGCCCTTACCCTCTTGTCCAAGCA 
DMDDFTGKSCNQGPYPLVQA 

1330 1350 1370 

GTCAAGAGAAGCCTTGGCTCCCTGTGAaggattaacttacagagaagcaggcaagatgac 
VKRSLGSL* 

1390 1410 1430 

cttgctgcctggggcctgctctctcccaggaattctcatgtgggattccccttgccaggc 

1450 1470 1490 

cggcctttggatctctcttccaagcctttcctgacttcctcttagatcatagattggacc 

1510 1530 1550 

tggttttgttttcctgcagctgatgacttgttgccctgaagtacaataaaaaaaattcat 

1570 1590 
tttgctccagtaaaaaaaaaaaaaaaaaactcgag 
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10 30 50 

GGAATTCGGCACGAGGAGCAACCACCATGGACCAGAAGTCTCTCTGGGCAGGTGTAGTGG 

M D Q K S I W A fl v v y 
70 90 110 

TCnGCTGCTTCTCCAGGGAGGATCTGCCTACAMCTGGTnGCTACTTTACCAACTGGT 
I- I- L L Q G G $ AYKLVCYFTNWS 
130 150 170 

CCCAGGACCGGCAGGAACCAGGAAAATTCACCCCTGAGAATATTGACCCCTTCCTATGCT 

QDRQEPGKFTPENIDPFLCS 
190 210 230 

ctcatctcatctattcattcgccagcatcgaamcaacmggttatcatcaaggacaaga 
hliysfasiennkvi ikdks 

250 270 290 

GTGMGTGATGCTCTACCAGACCATCMCAGTCTCA^CGMGMTGCCAMCTGAAAA 
EVMLYQTINSLKTKNPKLKI 
310 330 350 

TTCTCTTGTCCATTGGAGGGTACCTGTTTGGTTCCAAAGGGTTCCACCCTATGGTGGATT 
LLS I GG YLFGSKGFHPMVD S 
370 390 410 

CTTCTACATCACGCTTGGMTTCATTAACTCCATMTCCTGTTTCTGAGGAACCATAACT 
STSRLEFI NSI ILFLRNHN F 
430 450 470 

nGATGGACTGGATGTMGCTGGATCTACCCAGATCAGAMGAAMCACTCATTTCACTG 
DGL D V SW I Y PDQ KE NTH FT%V 
490 510 530 

TGCTGATTCATGAGTTAGCAGMGCCTTTCAGAAGGACTTCACAAAATCCACCAAGGAAA 
LIHELAEAFQKDFTKSTKER 
550 570 590 

GGCTTCTCTTGACTGCGGGCGTATCTGCAGGGAGGCAAATGATTGATAACAGCTATCAAG 
LLLTAGVSAGRQMI D NSYQ V 
610 630 650 

TTGAGAAACTGGCAAMGATCTGGATTTCATCMCCTCCTGTCCTTTGACTTCCATGGGT 
EKLAKDLDF INLLSFDFHG S 
670 690 710 

CTTGGGAAAAGCCCCTTATCACTGGCCACAACAGCCCTCTGAGCAAGGGGTGGCAGGACA 
W EKPL ITGHNS PLSKGWQD R 
730 750 770 

GAGGGCCAAGCTCCTACTACAATGTGGAATATGCTGTGGGGTACTGGATACATAAGGGAA 
GPSSYYNVEYAVGYWIHKGM 
790 810 830 

TGCCATCAGAGAAGGTGGTCATGGGCATCCCCACATATGGGCACTCCTTCACACTGGCCt 
PSEKVVMGI PTYGHSFTLAS 
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850 870 890 

CTGCAGAAACCACCGTGGGGGCCCCTGCCTCTGGCCCTGGAGCTGCTG6ACCCATCACA6 
AETTVGA PASGPGAA'GP I TE 
910 930 950 

AGTCTTCAGGCTTCCTGGCCTATTATGAGATCTGCCAGTTCCTGAAAGGAGCCAAGATCA 
SSGFLAYYE ICQFLKGAKIT 
970 990 1010 J 

CGAGGCTCCAGGATCAGCAGGTTCCCTACGCAGTCAAGGGGAACCAGTGGGTGGGCTATG 
RLQDQQVPYAVKGNQWVGYD 
1030 1050 1070 

ATGATGTGAAGAGTATGGA6ACCAAGGTTCAGTTCTTAAAGAATTTAAACCTGGGAGGAG 
DVKSMETKVQFLKNLNLGGA 
1090 1110 1130 

CCATGATCTGGTCTATTGACATGGATGACTTCACTGGCAAATCCTGCAACCAGGGCCCTT 
MIWSIDM. DDFTGKSCNQGPY 
1150 1170 1190 , 

ACCCTCTTGTCCAAGCAGTCAAGAGAAGCCTTGGCTCCCTGTGAAGGATTAACTTACAGA 
PLVQAVKRSLGSL* 

1210 .1230 1250 

GAAGCAGGCAAGATGACCTTGCTGCCTGGGGCCTGCTCTCTCCCAGGAATTCTCATGTGG 

1270 ; 1290 1310 

GATTCCCCTTGC(^GGCCGGCCTTTGGATCTCTCTTC(^GCCTTTCCTGACTTCCTCTT 

1330' 1350 1370 

AGATCATAGATTGGACCTGGTTTTGTTTTCCTGCAGCTGATGACTTGTTGCCCTGAAGTA 

1390 1410 1430 

CMTAAAAAAMnCATTTTGCTCCAAAAAAAAAAAAAAAAAAAAACTCGAG 
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1 GGAATTC66CACGAGGAGCAACCACCATGGACCA6AAGTCTCTCTG 46 

J II MIMIIIIIIIIIIIIMIIIIIMIIIIM 

251 a agct ggcca agga t ATGGGAGCAACCACCATGGACCAGAAGTCTCTCTG 300 

47 GGCAGGTGTAGTGGTCTTGCTGCTTCTCCAGGG ' 79 

llimillllllllMiilllllllillllll 
301 GGCAGGTGTAGTGGTCTTGCTGCTTCTCCAGGGAGAGATGGGGTTTTGCT 350 



351 ATGTTGCCAGAGCTGGTCTTGAACTCCTGGGCTCAAGAAGTCCTCCTGCC 400 

80 • AGGATCTGGCTACAAACTGGTTTG 103 

401 TCAGCCTCCCAMGTGCTGGGATMCAG^^ 450 

104 CTACTTTACCMCTGGTCCCAGGACCGGCAGGAACCAGGAAAATTCACCC 153 

MlilMIIIIIIMilllllllllMMIIMMIMlMlliiMiii 

451 CTACTTTACCAACTGGTCCCAGGACCGGCAGGAACCAGGAAAATTCACCC 500 
154 CTGAGMTATTGACCCCTTCCTATGCTCTCATCTCATCTATTCATTCGCC 203 

iimiiiiiiiiiiiimriiiiiiiiiiiimiiiiiiiriiiiii 

501 CTGAGAATATTGACCCCTTCCTATGCTCTCATCTCATCTATTCATTCGCC v550 
204 AGCATCGAAAACAACAAGGTTATCATCAAGGACAAGAGTGAAGTGATGCT 253 

1 1 1 1 1 1 1 } 1 1 1 1 1 1 1 1 1 1 1 1 1 m f 1 f 1 1 1 j 1 1 1 [ 1 1 1 1 1 r f 1 1 f f 1 1 1 1 i 

551 AGCATCGAAAACAACAAGGTTATCATCAAGGACAAGAGTGAAGTGATGCT 600 
254 CTACCAGACOTCMCAGTCTCAAMCCMGMTCCCAAACTGAAAATTC 303 

lllillllNlllllllllllilllllllilllllllllllllllMlil 

601 CTACCAGACCATCAACAGTCTCAAAACCAAGAATCCCAAACTGAAAATTC 650 
304 TCTTGTCCATTGGAGGGTACCTGTTTGGTTCCAAAGGGTTCCACCCTATG 353 

1 1 1 1 1 M 1 1 ! 1 1 1 1 1 f 1 1 II M [ I M 1 1 1 1 M I j 1 1 1 1 1 1 f 1 1 1 1 1 M 1 1 

651 TCTTGTCCATTGGAGGGTACCTGTTTGGTTCCAAAGGGTTCCACCCTATG 700 
354 GTGGATTCTTCTACATCACGCTTGGM™TTAACTCCATAATCCTGTT 403 

IIIINIIIIIMIIIIIIIIIIIIIIilllliiiiiiiiiiiiiiMii 

701 GTGGATTCTTCTACATCACGCTTGGMTTCATTMCTCCATAATCCTGTT 750 . 
404 TCTGAGGAACCATAACTTTGATGGACTGGATGTAAGCTGGATCTACCCAG 453 

nr illlllllllililllllllllllllilllllllllllllllMIIIIII 

751 TCTGAGGAACCATAACTTTGATGGACTGGATGTAAGCTGGATCTACCCAG 800 

CII FIG.6A 
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454 ATCAGAAAGAAAACACTCATTTCACTGTGCTGATTCATGAGTTAG CAGAA 503 

lllllllllllllllllllllllllillllllilllllllllllllllll 
801 ATCAGAAAGAAAACACTCATTTCACTGTGCTGATTCATGAGTTAGCAGAA 850 

504 GCCTTTCAGMGGACTTCACAAAATCCACCAAGGAAAGGCTTCTCTTGAC 553 

llllllllltllllllllllllllllllllllllllllllllllilllll 
851 GCCTTTCAGAAGGACTTCACAAAATCCACCAAGGAAAGGCTTCTCTTGAC 900 

* * * • • 

554 TGCGGGCGTATCTGCAGGGAGGCAAATGATTGATAACAGCTATCAAGTTG 603 

lllillllillllllllllllllllMIIIIIIIIIIIIIIIIIIIIII! 
901 TGCGGGCGTATCTGCAGGGAGGCAAATGATTGATAACAGCTATCAAGTTG 950 

604 AGAMCTGGCAAMGATCTGGATTTCATCAACCTCCTGTCCTTTGACTTC 653 

irillllllllUIMIIIIilllllltllllllllillllMIMIIII 

951 AGAAACTGGCAAAAGATCTGGATTTCATCAACCTCCTGTCCTTTGACTTC 1000 

654 CATGGGTCTTGGGAAAAGCCCCTTATCACTGGCCACAACAGCCCTCTGAG 703 

llllllllillllliirillllllllllllllllllllllilllllllll 
1001 CATGGGTCTTGGGAAAAGCCCCTTATCACTGGCCACAACAGCCCTCTGAG 1050 

• • • • . 

704 CAAGGGGTGGCAGGACAGAGGGCCAAGCTCCTACTACAATGTGGAATATG 753 

IIIIIIIIIIMIIIIIIIIIIIIIIIilllllllllllMlllllllil , 

1051 CAAGGGGTGGCAGGACAGAGGGCCAAGCTCCTACTACAATGTGGAATATG 1100 
754 CTGTGGGGTACTGGATACATAAGGGAATGCCATCAGAGAAGGTGGTCATG 803 

linillMIIIIIIMIIIIIIIIIIIIUIIIIIIIlllillllllll 

1101 CTGTGGGGTACTGGATACATAAGGGAATGCCATCAGAGAAGGTGGTCATG 1150 

804 GGCATCCCCACATATGGGCACTCCTTCACACTGGCCTCTGCAGAAACCAC 853 

IIIIIIMIIIIilllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIII 
1151 GGCATCCCCACATATGGGCACTCCTTCACACTGGCCTCTGCAGAAACCAC 1200 

854 CGTGGGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGAGT 903 

lllllllilllllllllMlllllllllllillllllllllllMIIIII 
1201 CGTGGGGGCCCCTGCCTCTGGCCCTGGAGCTGCTGGACCCATCACAGAGT 1250 

904 CTTCAGGCTTCCTGGCCTATTATGAGATCTGCCAGTTCCTGAAAGGAGCC 953 

IIIIIIIIIIIIINIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIII 
1251 CTTCAGGCTTCCTGGCCTATTATGAGATCTGCCAGTTCCTGAAAGGAGCC 1300 

954 AAGATCACGAGGCTCCAGGATCAGCAGGTTCCCTACGCAGTCMGGGGAA 1003 

lllllllllllllllllllllillllllllllllllllllltllllllll 
1301 AAGATCACGAGGCTCCAGGATCAGCAGGTTCCCTACGCAGTCAAGGGGAA 1350 
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1004 CCAGTGGGTGG6CTATGAT6ATGT6AAGAGTATG6AGACCAAGGTTCAGT 1053 
ioc JIN 1 1 1 1 1 1 f 1 1 ( N 1 1 1 f f I [ M I II 1 1 1 1 1 II 1 1 1 M 1 1 f 1 1 M 1 1 1 
1351 CCAGTGGGTGGGCTATGATGATGTGAAGAGTATGGAGACCAAGGTTCAGT 1400 

1054 TCTTAAAGAATTTAAACCTGGGAGGAGCCATGATCTGGTCTATTGACATG 1103 

MMIIIllllllllilllllllillllllllllllllllllllllilir 

1401 TCTTAAAGAATTTAAACCTGGGAGGAGCCATGATCTGGTCTATTGACATG 1450 
1104 GATGACTTCACTGGCAAATCCTGCAACCAGGGCCCTTACCCTCTTGTCCA 1153 

IMNIIIIIIIMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIII 

1451 GATGACTTCACTGGCAAATCCTGCAACCAGGGCCCTTACCCTCTTGTCCA 1500 
1154 AGCAGTCMGAGMGCCTTGGCTCCCTGTGAAGGATTAACTTACAGAGAA 1203 

miiiiiimimiiiiiiiiiiiMiiiiiiiiiiiiiiimiii 

1501 AGCAGTCAAGAGAAGCCTTGGCTCCCTGTGAaggattaacttacagagaa 1550 
1204 GCAGGCAAGATGACCTTGCTGCCTGGGGCCTGCTCTCTCCCAGGAATTCT 1253 

rr llllllllillllllllllllllllllllllllllllllllllllllMI 
1551 gcaggcaagatgaccttgctgcctggggcctgctctctcccaggaattct 1600 

1254 CATGTGGGATTCCCCTTGCCAGGCCGGCCTTTGGATCTCTCTTCCAAGCC 1303 

IMIIIIIIIIIMIIIIIIIIIIIIMIMIMMIIIIMIIIIIII! 

1601 catgtgggattccccttgccaggccggcctttggatctctcttccaagcc 1650 
1304 TTTCCTGACTTCCTCTTAGATCATAGATT^CCTGGTTTTGTTTTCCTGi 1353 

rr 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 1 M t i 1 1 1 M 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 ! i 1 1 
1651 tttcctgacttcctcttagatcatagattggacctggttttgttttcctg 1700 

1354 CAGCTGATGACTTGTTGCCCTGMGTACMTAAAAAAMTTCATTTTGCT 1403 

IMIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

1701 cagctgatgacttgttgccctgaagtacaataaaaaaaattcattttgct 1750 
1404 CCAAAAAAAAAAAAAAAAAAAAACTCGAG. . . 1432 

llillllllllllllllllilllllllll 

1751 ccagtaaaaaaaaaaaaaaaaaactcgag '1779 



FIG.6C 



SUBSnTUTE S»m (RULE 26) 



WO 98/06859 



PCT/US96/13003 



12/12 



1 MDQKSLWAGVVVLLLLQG 18 

I ! I [ 1 1 1 M 1 1 1 1 1 ! 1 1 1 

1 MGATTMDQKSLWAGVVVLLLLQGEMGFCYVARAGLELLGSRSPPASASQS . 50 
19 . . . . GSAYKLVCYFTNWSQDRQEPGKFTPENIDPFLCSHL I YSFASI ENN 64 

I f 1 1 M I II 1 1 M 1 1 1 f I! 1 1 M M I M If I M f t M 1 1 1 1 1 [ 1 M 

51 AG ITGSAYKL VCYFTNWSQDRQEPGKFTPENI DPFLCSHL I YSFAS IENN 100 

65 KVIIKDKSEVMLYQTINSLKTKNPKLKILLSIGGYLFGSKGFHPMVDSST 114 

IIIM1IIIIIIIIM1IIIIMIIIIIIIIIIIIIMIIIIIIIIIIII 
101 KVIIKDKSEVMLYQTINSLKTKNPKLKILLSIGGYLFGSKGFHPMVDSST 150 

115 SRLEF I NS 1 1 LFLRNHNFDGLDVSW I YPDQKE NTHFTVL I HELAEAFQKD 164 

llllllllllllllljlllllllllillllllllllilllllllllllll 
151 SRLEFINSI ILFLRNHNFDGLDVSWI YPDQKENTHFTVL IHELAEAFQKD 200 

165 FTKSTKERLLLTAGVSAGRQMIDNSYQVEKLAKDLDFINLLSFDFHGSWE 214 

lllllllllllllltlllllllllllllllllllllllllllllllllll 
201 FTKSTKERLLLTAGVSAGRQMIDNSYQVEKLAKDLDFINLLSFDFHGSWE 250 

215 KPLITGHNSPLSKGWQDRGPSSYYNVEYAVGYWIHKGMPSEKVVMGIPTY 264 

llllllllllllllllllllllllllllllllllllllllllllllllll 
251 KPLITGHNSPLSKGWQDRGPSSYYNVEYAVGYWIHKGMPSEKVVMGIPTY 300 

265 GHSFTLASAETTVGAPASGPGAAGP ITESSGFLAYYE ICQFLKGAJCITRL 314 

llllllllltllllllltllllllllllllllllllllllllllllllll 
301 GHSFTLASAETTVGAPASGPGAAGPITESSGFLAYYEICQFLKGAKITRL 350 

315 QDQQVPYAVKGNQWVGYDDVKSMETKVQFLKNLNLGGAMIWSIDMDOFTG 364 

llllllllllllllllllllllllllllllllilllllllllltllllll 
351 QDQQVPYAVKGNQWVGYDDVKSMETKVQFLKNLNLGGAMIWSIDMDDFTG 400 

365 KSCNQGPYPLVQAVKRSLGSL . 385 

lllllllllllllllllllll 
401 KSCNQGPYPLVQAVKRSLGSL* 422 
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of any additional fee. 

3. | | As only some of the required additional -carch fees were timely paid by the applicant, this international search report cove 

only those claims for which fees were paid, specifically claims Nos.: 



4. | x] No required additional search fees were timely paid by the applicant. Consequently, this international search report : 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-15 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

[ ^ No protest accompanied the payment of additional search fees. 
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BOX 0. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA f und multiple inventions as f llows: 

This application contains the following inventions or groups of inventions which arc not so linked as to form a single 
inventive concept under PCT Rule 13.1- In order for all inventions to be examined, the appropriate additional 
examination fees must be paid. 

Group 1, claims 1-15, drawn to DNA, vectors, host cells and expression of human chitinase alpha and atpha-2. 
Group II, claims 16, 17 and 19, drawn to human chitinase alpha and atpha-2 protein and method of use. 
Group III, claim 18, drawn to human chitinase alpha and atpha-2 antibody. 

Group IV, claim 20, drawn to a method of treating a tissue remodelling disorder with a '.uman chitinase alpha or alpha- 
2 antagonist. 

The inventions listed as Groups MV do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: each of the 
four groups recites a chemically unrelated special technical feature. The special technical feature of Group I is a 
nucleic acid encoding human chitinase alpha or alpha-2, the special technical feature of Group II is a human chitinase 
alpha or aipha-2 protein, the special technical feature of Group III is a human chitinase alpha or atpha-2 antibody and 
the special technical feature of Group IV is a human chitinase alpha or alpha-2 antagonist. 
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